Siddharth Parekh

prof_pic-2.jpg

Hi! I’m Siddharth, a senior studying Computer Science at CMU. My current research interests lie in the intersection of natural language processing, machine learning, and game theory - towards building intelligent systems using language agents.

I’ve worked extensively with Professor Carolyn Rosé’s Document Understanding Group at CMU’s Language Technologies Institute - collaborating with Armineh Nourbakhsh on developing graph-based models for form processing, and robust evaluation metrics for document visual question answering.

Zooming out, I often find myself along the confluence of mathematics and computer science hoping to find innovative applications with strong theoretical foundations.

Feel free to reach out to me to chat about my research or any shared interests!

news

Jan 22, 2025 Our work on Where is this coming from? Making groundedness count in the evaluation of Document VQA models got accepted to NAACL 2025 Findings!
Sep 20, 2024 Our work on form understanding, AliGATr: Graph-based layout generation for form understanding got accepted to EMNLP 2024 Findings!

selected publications

  1. AliGATr: Graph-based layout generation for form understanding
    Armineh Nourbakhsh, Zhao Jin, Siddharth Parekh, and 2 more authors
    In Findings of the Association for Computational Linguistics: EMNLP 2024, Nov 2024
  2. Where is this coming from? Making groundedness count in the evaluation of Document VQA models
    Armineh Nourbakhsh, Siddharth Parekh, Pranav Shetty, and 3 more authors
    In Findings of the 2025 Annual Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: NLP in a Multicultural World, Apr 2025