Updates
Apr 25: 8th One-week short term course on Advances in Deep Architectures for Signal, Image and Vision Applications (ADASIVA 2026) 15-19 June 2026 | Learn More

iMeDIA Lab

Intelligent Medical and Document Image Analytics

Advancing healthcare and information extraction through state-of-the-art computer vision and deep learning.

Explore Our Research

About Our Lab

Our Mission

At the iMeDIA lab, we develop robust, scalable, and intelligent AI solutions. Our primary domains include analyzing complex medical imaging data for computer-aided diagnosis, video anomaly detection, and extracting structured information from unstructured document images using OCR and layout analysis.


We collaborate with clinical experts and industry partners to bridge the gap between theoretical machine learning and real-world application.

Core Technologies

  • Deep Learning & Neural Networks
  • Computer Vision
  • Medical Image Segmentation & Classification
  • Optical Character Recognition (OCR)
  • Document Layout Analysis
  • Natural Language Processing (NLP) integration
  • Object detection & tracking
  • Video Understanding

Research Areas

Medical Image Diagnostics

Medical Image Diagnostics

Developing AI models to analyze retinal fundus images and infrared dryeye images to assist radiologists in early detection of anomalies and diseases.

Document AI

Document AI

Creating algorithms capable of reading, understanding, and structuring data from scanned documents, historical archives, and handwritten notes.

Multimodal Learning

Multimodal Learning

Combining visual data (images) with textual data (clinical reports or document text) to create comprehensive AI reasoning systems.

Deep Learning & Generative Models

Deep Learning & Generative Models

Leveraging neural networks to learn powerful data representations. Generating realistic data using GANs, VAEs, and diffusion-based models.

Video Understanding

Video Understanding

Analyzing spatial and temporal patterns in video data. Enabling tasks like action recognition, event or anomaly detection, and scene interpretation.

Object Detection and Tracking

Object Detection and Tracking

Detecting and localizing objects in images and videos with high precision. Tracking their movement across frames for real-time and intelligent analysis.

News & Announcements

May 5, 2026
News

Our Research Featured in Dainik Jagran

We are pleased to announce that our recent research on deep learning based dry eye segmentation has been featured in …

April 11, 2026
Open Seminar

Pre-thesis Submission (Open) Seminar of Mr. Suvramalya Basak (RSI2022003)

Pre-thesis Submission (Open) Seminar of Mr. Suvramalya Basak (RSI2022003) will be held on 23 April 2026 at 11:30 AM in …

Dec. 23, 2025
Award

Best Paper Award at CICT 2025

Our paper, "DualDiffSeg: Dual-head Diffusion Probabilistic Model for Meibomian Glands Segmentation", received the Best Paper Award at CICT 2025. Congratulations …

Dec. 2, 2024
Conference

Poster presentation at ICPR 2024

Paper presented at ICPR 2024 by Suvramalya Basak. The work was titled "Multi-teacher Importance Preserving Knowledge Distillation for Early Violence …

Recent Publications

2026 DIFEM: Key-points Interaction based Feature Extraction Module for Violence Recognition in Videos Himanshu Mittal, Suvramalya Basak, Anjali Gautam Signal, Image and Video Processing
View Paper / PDF
2025 DualDiffSeg: Dual-head Diffusion Probabilistic Model for Meibomian Glands Segmentation Ankit Kumar Verma, Anjali Gautam IEEE 9th International Conference on Information and Communication Technology (CICT). IEEE, 2025.
View Paper / PDF
2024 Diffusion-based normality pre-training for weakly supervised video anomaly detection Suvramalya Basak, Anjali Gautam Expert Systems with Applications
View Paper / PDF
2024 Multi-teacher Importance Preserving Knowledge Distillation for Early Violence Prediction Suvramalya Basak, Aditya Vaishy, Anjali Gautam 27th International Conference on Pattern Recognition (ICPR)
View Paper / PDF

Our Team

Professors

Dr. Mohammad Javed

Dr. Mohammad Javed

Associate Professor

javed@iiita.ac.in

Specializes in Document Image Analysis, Handwriting Recognition, and Deep Learning applications.

Dr. Anjali Gautam

Dr. Anjali Gautam

Assistant Professor

anjaligautam@iiita.ac.in

Specializes in Medical Image Analysis and Deep Learning applications.

Students

Suvramalya Basak

Suvramalya Basak

Research Scholar

rsi2022003@iiita.ac.in

Video Anomaly Detection

Apurba Chakraborty

Apurba Chakraborty

Research Scholar

rsi2024005@iiita.ac.in

Document Image Analysis

Ankit Kumar Verma

Ankit Kumar Verma

Research Scholar

rsi2024501@iiita.ac.in

Retinal Image Analysis

Amirtansh Maurya

Amirtansh Maurya

Research Scholar

rsi2024503@iiita.ac.in

Document Analysis

Aarti Jha

Aarti Jha

Research Scholar

rsi2025509@iiita.ac.in

Generative AI

Arbiya Sabri

Arbiya Sabri

Research Scholar

rsi2026007@iiita.ac.in

Generative AI

Vivek Vishwakarma

Vivek Vishwakarma

Junior Research Assistant

prf.vivek@iiita.ac.in

Alt Text Generation

Contact Us

iMeDIA Lab

Address: Room 5402 Computer Center-III (CC3)
Indian Institute of Information Technology Allahabad, Uttar Pradesh 211015, India

Email: imedia.iiita@gmail.com