My research focuses on Computer Vision, Generative AI, Large Language Models (LLMs), 3D Scene Understanding, and Machine Learning, with applications in intelligent visual systems, healthcare AI, and immersive technologies.
I develop scalable, interpretable, and high-performance AI models across the following areas:
-
3D Vision and Scene Understanding – point cloud processing, neural radiance fields (NeRF), and Gaussian splatting for photorealistic 3D reconstruction and rendering.
-
Computer Vision – image understanding, object detection, semantic segmentation, and depth estimation for spatial intelligence.
-
Multimodal AI and Vision-Language Models – integration of vision and language models (e.g. CLIP, BLIP, Flamingo, and LLM-based frameworks) for cross-modal reasoning and grounded AI systems.
-
Large Language Models (LLMs) – efficient fine-tuning, multimodal extension, and integration with vision systems for reasoning, automation, and intelligent agents.
-
Generative Adversarial Networks (GANs) – image synthesis, style transfer, data augmentation, and domain adaptation, including applications in medical imaging.
-
Diffusion Models – next-generation generative architectures for high-fidelity image, video, and 3D content generation.
This work advances state-of-the-art research in Computer Vision, Generative AI, and LLM-driven intelligent systems, with a focus on real-world deployment and scalable AI solutions.
My research utilises frameworks such as PyTorch, TensorFlow, CUDA, and GPU-accelerated computing, supporting efficient experimentation and large-scale model development.
PhD Supervision
I supervise PhD research in Computer Vision, Generative AI, Large Language Models (LLMs), Multimodal AI, and 3D Scene Understanding.
I welcome enquiries from highly motivated candidates with a strong background in Artificial Intelligence, Machine Learning, Computer Science, or related disciplines.
Projects may be undertaken through self-funded, externally funded (e.g. government or industry), or collaborative routes, depending on alignment with ongoing research and funding opportunities, including UKRI, EPSRC, and Horizon Europe.
Prospective applicants are encouraged to make initial contact to discuss potential research directions.
I am a recognised PhD supervisor within the Æß²ÊÖ±²¥ Doctoral College, and my profile can be found by searching for my name on the university’s doctoral supervisor directory. I particularly welcome enquiries from candidates interested in innovative research at the intersection of Generative AI, Computer Vision, and Large Language Models, especially in real-world and interactive systems.
Collaborations and Research Alignment
My research is well aligned with the funding and collaboration priorities of UKRI (EPSRC), Innovate UK, and Horizon Europe in Generative AI, Large Language Models, and data-driven intelligent systems. I welcome opportunities for academic and industry collaboration in these areas.
Research Areas and Expertise
Computer Vision, Generative AI, Machine Learning, Large Language Models (LLMs), Multimodal AI, 3D Vision, Diffusion Models, GANs, Neural Radiance Fields (NeRF), Gaussian Splatting, Vision-Language Models, AI for Healthcare