Aug 20, 2025 |
Our paper, “A Systematic Analysis of Base Model Choice for Reward Modeling”, is accepted to EMNLP 2025.
|
May 19, 2025 |
Started my internship at Microsoft Turing working on improving the efficiency of reasoning language models.
|
Mar 31, 2025 |
I passed my qualifying exam and officially became a PhD Candidate.
|
Jan 22, 2025 |
Our paper, “A Practical Analysis of Human Alignment with *PO”, is accepted to NAACL 2025 Findings.
|
Sep 26, 2024 |
Our paper, “MARVEL: Multidimensional Abstraction and Reasoning through Visual Evaluation and Learning”, is accepted to NeurIPS 2024 Datasets and Benchmarks Track.
|
Jul 10, 2024 |
Our paper, “The Curious Case of Nonverbal Abstract Reasoning with Multi-Modal Large Language Models”, is accepted to COLM 2024.
|
May 13, 2024 |
Started my internship at Microsoft Turing working on preference optimization and reinforcement learning from human feedback (RLHF) for human alignment in large language models (LLMs).
|
Oct 07, 2023 |
Our paper, “Temporal Knowledge Graph Forecasting Without Knowledge Using In-Context Learning”, is accepted to EMNLP 2023.
|
May 15, 2023 |
Started my internship at Microsoft Turing working on extending the context length of large language models (LLMs) using a retrieval-augmented attention mechanism.
|