Siddharth Parekh

Hi! I’m Siddharth, a student at Carnegie Mellon University where I’m completing my Fifth-Year Master’s in Computer Science. My current research interests lie in factuality and how knowledge is learned and represented within Large Language Models.
I’ve worked extensively with Professor Carolyn Rosé’s Document Understanding Group at CMU’s Language Technologies Institute - collaborating with Armineh Nourbakhsh on developing graph-based models for form processing, and robust evaluation metrics for document visual question answering.
Zooming out, I often find myself digging into the weeds of deep neural networks hoping to uncover something interesting.
Feel free to reach out to me to chat about my research or any shared interests!
news
Aug 25, 2025 | I’m starting my Fifth-Year Master’s at CMU! |
---|---|
May 08, 2025 | I graduated from CMU with University and SCS College Honours! |
Jan 22, 2025 | Our work, Where is this coming from? Making groundedness count in the evaluation of Document VQA models, has been accepted to NAACL 2025 Findings! |
Sep 20, 2024 | Our work, AliGATr: Graph-based layout generation for form understanding, has been accepted to EMNLP 2024 Findings! |
selected publications
- Where is this coming from? Making groundedness count in the evaluation of Document VQA modelsIn Findings of the Association for Computational Linguistics: NAACL 2025, Apr 2025