Shwetha Somasundaram

prof_pic_final.png

I am currently working as a Research Associate II at the Multimodal Content Experiences Lab at Adobe Research. In the last 2.5 years I’ve primarily worked with Dr. Apoorv Saxena and Dr. Balaji Srinivasan on leveraging Large Language models (LLMs)/ Multimodal Large Language Models (MLLMs) for document experience projects for Adobe Acrobat and Adobe Express. I’ve worked a wide range of research areas: retrieval and attribution for document question answering, document stylization and transformation, graphic design generation and speculative decoding. I am currently interested and working on model merging techniques for LLMs/VLMs and using model internals for interpretibility.

I completed my bachelor’s thesis under the supervision of Prof. N Venkateswaran at SSN College of Engineering. My project focused on the road object detection from radar sensor data using machine learning and deep learning object detection techniques. During my undergraduate studies, I also explored the estimation of tracer kinetic parameters from undersampled DCE-MRI data, under the supervision of Dr. Phaneendra Yalavarthy at the Medical Imaging Lab, Indian Institute of Science, Bangalore.

If you’d like to know more about my work or discuss potential collaborations, please check out my CV. I’m always open to new opportunities and interesting conversations!

Publications

2024

  1. NAACL Findings 2025
    PLD+: Accelerating LLM inference by leveraging Language Model Artifacts
    Shwetha Somasundaram, Anirudh Phukan, and Apoorv Saxena
    arXiv preprint arXiv:2412.01447, 2024
  2. AAAI 2025
    PostDoc: Generating Poster from a Long Multimodal Document Using Deep Submodular Optimization
    Vijay Jaisankar, Sambaran Bandyopadhyay, Kalp Vyas, and 2 more authors
    arXiv preprint arXiv:2405.20213, 2024
  3. ACL Findings 2024
    Peering into the Mind of Language Models: An Approach for Attribution in Contextual Question Answering
    Anirudh Phukan, Shwetha Somasundaram, Apoorv Saxena, and 2 more authors
    In Findings of the Association for Computational Linguistics ACL 2024, 2024
  4. EACL Main 2024
    Presentations by the Humans and For the Humans: Harnessing LLMs for Generating Persona-Aware Slides from Documents
    Ishani Mondal, Shwetha S, Anandhavelu Natarajan, and 3 more authors
    In Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics (Volume 1: Long Papers), Mar 2024

2023

  1. EMNLP Findings 2023
    Drilling Down into the Discourse Structure with LLMs for Long Document Question Answering
    Inderjeet Nair*Shwetha Somasundaram*, Apoorv Saxena, and 1 more author
    In Findings of the Association for Computational Linguistics: EMNLP 2023, Mar 2023

Patents

  1. PLD+: Accelerating LLM inference by leveraging the hidden states of Language Models (US Patent App. 18/924,398)
  2. Evidence Retrieval for Long Document Question Answering Using Large Language Models (US Patent App. 18/508,437)
  3. Automatic generation of handouts from multi-modal documents (US Patent App. 18/542,161)
  4. Merging misidentified text structures in a document (US Patent App. 18/511,111)
  5. Generating targeted layouts from source documents utilizing large language models with semantic hierarchical transformations (US Patent App. 18/809,147)
  6. Generating a digital poster including multimodal content extracted from a source document
  7. Document-based presentation generation (US Patent App. 18/675,451)