iMeDIA Lab | Intelligent Medical and Document Image Analytics

About Our Lab

Our Mission

At the iMeDIA lab, we develop robust, scalable, and intelligent AI solutions. Our primary domains include analyzing complex medical imaging data for computer-aided diagnosis, video anomaly detection, and extracting structured information from unstructured document images using OCR and layout analysis.

We collaborate with clinical experts and industry partners to bridge the gap between theoretical machine learning and real-world application.

Core Technologies

Deep Learning & Neural Networks
Computer Vision
Medical Image Segmentation & Classification
Optical Character Recognition (OCR)
Document Layout Analysis
Natural Language Processing (NLP) integration
Object detection & tracking
Video Understanding

Research Areas

Medical Image Diagnostics

Developing AI models to analyze retinal fundus images and infrared dryeye images to assist radiologists in early detection of anomalies and diseases.

Document AI

Creating algorithms capable of reading, understanding, and structuring data from scanned documents, historical archives, and handwritten notes.

Multimodal Learning

Combining visual data (images) with textual data (clinical reports or document text) to create comprehensive AI reasoning systems.

Deep Learning & Generative Models

Leveraging neural networks to learn powerful data representations. Generating realistic data using GANs, VAEs, and diffusion-based models.

Video Understanding

Analyzing spatial and temporal patterns in video data. Enabling tasks like action recognition, event or anomaly detection, and scene interpretation.

Object Detection and Tracking

Detecting and localizing objects in images and videos with high precision. Tracking their movement across frames for real-time and intelligent analysis.

News & Announcements

May 5, 2026

News

Our Research Featured in Dainik Jagran

We are pleased to announce that our recent research on deep learning based dry eye segmentation has been featured in …

April 11, 2026

Open Seminar

Pre-thesis Submission (Open) Seminar of Mr. Suvramalya Basak (RSI2022003)

Pre-thesis Submission (Open) Seminar of Mr. Suvramalya Basak (RSI2022003) will be held on 23 April 2026 at 11:30 AM in …

Dec. 23, 2025

Award

Best Paper Award at CICT 2025

Our paper, "DualDiffSeg: Dual-head Diffusion Probabilistic Model for Meibomian Glands Segmentation", received the Best Paper Award at CICT 2025. Congratulations …

Dec. 2, 2024

Conference

Poster presentation at ICPR 2024

Paper presented at ICPR 2024 by Suvramalya Basak. The work was titled "Multi-teacher Importance Preserving Knowledge Distillation for Early Violence …

View All Announcements →

Recent Publications

2026 DIFEM: Key-points Interaction based Feature Extraction Module for Violence Recognition in Videos Himanshu Mittal, Suvramalya Basak, Anjali Gautam Signal, Image and Video Processing
View Paper / PDF

2025 DualDiffSeg: Dual-head Diffusion Probabilistic Model for Meibomian Glands Segmentation Ankit Kumar Verma, Anjali Gautam IEEE 9th International Conference on Information and Communication Technology (CICT). IEEE, 2025.
View Paper / PDF

2024 Diffusion-based normality pre-training for weakly supervised video anomaly detection Suvramalya Basak, Anjali Gautam Expert Systems with Applications
View Paper / PDF

2024 Multi-teacher Importance Preserving Knowledge Distillation for Early Violence Prediction Suvramalya Basak, Aditya Vaishy, Anjali Gautam 27th International Conference on Pattern Recognition (ICPR)
View Paper / PDF

View All Publications →

Our Team

Professors

Dr. Mohammad Javed

Associate Professor

javed@iiita.ac.in

Specializes in Document Image Analysis, Handwriting Recognition, and Deep Learning applications.

Dr. Anjali Gautam

Assistant Professor

anjaligautam@iiita.ac.in

Specializes in Medical Image Analysis and Deep Learning applications.

Students

Suvramalya Basak

Research Scholar

rsi2022003@iiita.ac.in

Video Anomaly Detection

Apurba Chakraborty

Research Scholar

rsi2024005@iiita.ac.in

Document Image Analysis

Ankit Kumar Verma

Research Scholar

rsi2024501@iiita.ac.in

Retinal Image Analysis

Amirtansh Maurya

Research Scholar

rsi2024503@iiita.ac.in

Document Analysis

Aarti Jha

Research Scholar

rsi2025509@iiita.ac.in

Generative AI

Arbiya Sabri

Research Scholar

rsi2026007@iiita.ac.in

Generative AI

Vivek Vishwakarma

Junior Research Assistant

prf.vivek@iiita.ac.in

Alt Text Generation

About Our Lab

Our Mission

Core Technologies

Research Areas

Medical Image Diagnostics

Medical Image Diagnostics

Document AI

Document AI

Multimodal Learning

Multimodal Learning

Deep Learning & Generative Models

Deep Learning & Generative Models

Video Understanding

Video Understanding

Object Detection and Tracking

Object Detection and Tracking

News & Announcements

Our Research Featured in Dainik Jagran

Pre-thesis Submission (Open) Seminar of Mr. Suvramalya Basak (RSI2022003)

Best Paper Award at CICT 2025

Poster presentation at ICPR 2024

Recent Publications

Our Team

Professors

Dr. Mohammad Javed

Dr. Anjali Gautam

Students

Suvramalya Basak

Apurba Chakraborty

Ankit Kumar Verma

Amirtansh Maurya

Aarti Jha

Arbiya Sabri

Vivek Vishwakarma

Contact Us