pic_mohbat.jpg

Fnu Mohbat

Ph.D. Candidate

*** open for new opportunities ***

I am a Ph.D. Candidate in Computer Science at Rensselaer Polytechnic Institute, advised by Prof. Mohammad J. Zaki. My research focuses on improving Large Language Models (LLMs) and Multi-Modal Models (MMMs) by integrating Knowledge Graphs (KGs) through Retrieval-Augmented Generation (RAG), with applications in food computing, document understanding, and text generation. I have collaborated with researchers at IBM Research, where I worked on visually rich document understanding and bias in multi-modal LLMs.

news

Education

  • 2021.01 - 2025.05

    Troy, NY, USA

    Doctor of Philosophy
    Rensselaer Polytechnic Institute
    Computer Science
  • 2016.08 - 2018.06

    Lahore, Pakistan

    Master of Science
    Lahore University of Management Sciences
    Electrical Engineering
  • 2012.08 - 2013.05

    Fort Wayne, IN, USA

    Undergraduate Exchange
    Ivy Tech Community College
    Engineering
  • 2009.09 - 2014.07

    Lahore, Pakistan

    Bachelor of Science
    COMSATS Institute of Information Technology
    Computer Science

Research experience

  • 2023.05 - 2023.08

    Remote

    Summer Extern
    IBM Thomas J. Watson Research Center
    Investigated the impact of stable diffusion techniques on manipulating visual concepts within MMMs to produce targeted text such as stories and summaries.
  • 2022.05 - 2022.08

    Yotktown Heights, NY

    Summer Extern
    IBM Thomas J. Watson Research Center
    Improved generalizstion of visually rich document understanding model by 10-30% by modeling documents as graphs and learning their embeddings using Transformer and Graph Neural Networks (GNNs) for downstream tasks.
  • 2021.08 - Present

    Troy, NY, USA

    Graduate Research Assistant
    Rensselaer Polytechnic Institute
    Conducting research to enhance Large Language Models (LLMs) and Multi-Modal Models (MMMs) using Knowledge Graphs (KGs) and Retrieval-Augmented Generation (RAG) for food computing applications.
  • 2021.05 - 2021.08

    Remote

    Summer Extern
    IBM Thomas J. Watson Research Center
    Developed and trained transformer models and object detection models for document text understanding, focusing on classification and key-value prediction.
  • 2019.04 - 2020.12

    Lahore, Pakistan

    Research Associate in National Agriculture Robotics Lab
    Lahore University of Management Sciences
    Lead team of 5-10 researchers for developing IoT solutions for water quality monitoring, and improved model compression techniques by reducing model size by 10-30 times.

Current Research

  1. knowfm2025.png
    Knowledge Graph-Enhanced LLM for Food Recommendation through Question Answering
    Fnu Mohbat, and Mohammed J. Zaki
    Towards Knowledgeable Foundation Models (KnowFM) at AAAI, 2025
  2. model_llavachef.png
    LLaVA-Chef: A Multi-modal Generative Model for Food Recipes
    Fnu Mohbat, and Mohammed J. Zaki
    Proceedings of the 33rd ACM International Conference on Information and Knowledge Management (CIKM), 2024
  3. trustnlp_24.png
    Beyond Visual Augmentation: Investigating Bias in Multi-Modal Text Generation
    Fnu Mohbat, Vijay Sadashivaiah, Keerthiram Murugesan , and 3 more authors
    Fourth Workshop on Trustworthy Natural Language Processing (TrustNLP) at NAACL, 2024
  4. gvdoc.png
    GVdoc - Graph-based Visual DOcument Classification
    Fnu Mohbat, Mohammed J. Zaki, Catherine Finegan-Dollak , and 1 more author
    Findings of the Association for Computational Linguistics, 2023
  5. bmvc_21.png
    Teacher-Class Network: A Neural Network Compression Mechanism
    Shaiq Munir Malik, Muhammad Umair Haider, Mohbat Tharani , and 2 more authors
    The 32nd British Machine Vision Conference (BMVC), 2021
  6. iconip_21.png
    Trash Detection on Water Channels
    Mohbat Tharani, Abdul Wahab Amin, Fezan Rasool , and 3 more authors
    International Conference on Neural Information Processing, 2021