I am currently pursuing a Master's degree in Computer Science and Statistics at Harvard University with a specialization in Artificial Intelligence. I am also working as a Graduate Research Assistant in the Multisensory Intelligence Research Group at MIT Media Lab, under the supervision of Prof. Paul Liang. Before that, I was a Graduate Research Assistant at Harvard’s Data and Knowledge Exploration Lab (DTaK), where I was co-supervised by Prof. Finale Doshi-Velez and Prof. Wei Wei Pan. Previously, I worked as a Research Fellow/Engineer at Microsoft Research India (R&D) where I worked with Prof. Monojit Choudhury, Dr. Subhojit Som, Vishrav Chaudhury, Dr. Sunayana Sitaraman, and Dr. Saurabh Tiwary as part of the Turing team.

My research focuses on understanding the limitations and capabilities of contemporary AI systems, such as Large Language Models (LLMs), through mechanistic interpretability and model behavior control techniques like model editing and representation engineering. I am particularly interested in the mechanisms behind LLM behaviors and the democratization of these models by ensuring they are secure and unbiased without extensive fine-tuning. My work at Microsoft involved training mid-sized language models like Phi and Turing Bletchely in multilingual and multimodal settings respectively and integrating them into products like Bing Search and Microsoft Copilot.

Keywords: LLMs, model interpretability, relational compositionality, activation steering, SafeAI, multimodal learning, bias mitigation, multilingual learning

I graduated with a Bachelor's degree in Electrical Engineering, and minor in Computer Science, from IIT Kharagpur in 2022. For more detailed information about my background, please refer to my CV. If you’d like to chat about my work or research interests, feel free to reach out!

Recent News
Feb, 2025

Joined Multisensory Intelligence Research Group at MIT Media Lab!

Aug, 2024

Joined Harvard University and DtaK Lab!

Oct, 2023
Sep, 2023

Taught a workshop course in Practical NLP and Large Language Models (LLMs) for bachelor's students at the Indian Institute of Science (IISc) as part of Kotak-IISc AI-ML Centre!

Jul, 2022

Joined Microsoft Research India as a Research Fellow.

Publications


sPhinX: Sample Efficient Multilingual Instruction Fine-Tuning Through N-shot Guided Prompting
Sanchit Ahuja*, Kumar Tanmay*, Hardik Hansrajbhai Chauhan, Barun Patra, Kriti Aggarwal, Luciano Del Corro, Arindam Mitra, Tejas Indulal Dhamecha, Ahmed Awadallah, Monojit Choudhary, Vishrav Chaudhary, Sunayana Sitaram
Under Review pdf


Do Moral Judgment and Reasoning Capability of LLMs Change with Language? A Study using the Multilingual Defining Issues Test
Aditi Khandelwal*, Utkarsh Agarwal*, Kumar Tanmay*, Monojit Choudhury
EACL 2024 Oral pdf


Ethical Reasoning and Moral Value Alignment of LLMs Depend on the Language we Prompt them in
Utkarsh Agarwal*, Kumar Tanmay*, Aditi Khandelwal*, Monojit Choudhury
LREC-COLING 2024 pdf


Probing the Moral Development of Large Language Models through Defining Issues Test
Kumar Tanmay*, Aditi Khandelwal*, Utkarsh Agarwal*, Monojit Choudhury
Workshop on AI meets Moral Philosophy and Moral Psychology (MP2) - Neurips 2023 pdf


Ethical Reasoning over Moral Alignment: A Case and Framework for In-Context Ethical Policies in LLMs
Abhinav Rao*, Aditi Khandelwal*, Kumar Tanmay*, Utkarsh Agarwal*, Monojit Choudhury
Findings of EMNLP 2023 pdfslidesvideoposter


DUBLIN: Visual Document Understanding By Language-Image Network
Kriti Aggarwal*, Aditi Khandelwal*, Kumar Tanmay*, Owais Mohammed Khan, Qiang Liu, Monojit Choudhury, Hardik Hansrajbhai Chauhan, Subhojit Som, Vishrav Chaudhary, Saurabh Tiwary
EMNLP Industry Track 2023 pdfslidesvideoposter

Efficient Poverty Mapping from High Resolution Remote Sensing Images
Kumar Ayush*, Burak Uzkent*, Kumar Tanmay, Marshall Burke, David Lobell, Stefano Ermon
AAAI 2021 Oral pdf

Efficient Conditional Pre-training for Transfer Learning
Shuvam Chakraborty, Kumar Ayush*, Burak Uzkent*, Kumar Tanmay, Evan Sheehan, Stefano Ermon
Workshop on Learning with Limited Labelled Data for Image and Video Understanding (L3D-IVU) - CVPR 2022 pdf

Geography-Aware Self-Supervised Learning
Kumar Ayush*, Burak Uzkent*, Chenlin Meng*, Kumar Tanmay, Marshall Burke, David Lobell, Stefano Ermon
ICCV 2021 pdf

Service
Reviewer  for ACL 2023, AACL-IJCNLP 2023, NAACL 2024, TACL 2024, EMNLP 2024, NAACL 2025.

  Template: Sebastin