Shwetha Somasundaram

I am currently working as a Research Associate II at the Multimodal Content Experiences Lab at Adobe Research. In the last 2.5 years I’ve primarily worked with Dr. Apoorv Saxena and Dr. Balaji Srinivasan on leveraging Large Language models (LLMs)/ Multimodal Large Language Models (MLLMs) for document experience projects for Adobe Acrobat and Adobe Express. I’ve worked a wide range of research areas: retrieval and attribution for document question answering, document stylization and transformation, graphic design generation and speculative decoding. I am currently interested and working on model merging techniques for LLMs/VLMs and using model internals for interpretibility.
I completed my bachelor’s thesis under the supervision of Prof. N Venkateswaran at SSN College of Engineering. My project focused on the road object detection from radar sensor data using machine learning and deep learning object detection techniques. During my undergraduate studies, I also explored the estimation of tracer kinetic parameters from undersampled DCE-MRI data, under the supervision of Dr. Phaneendra Yalavarthy at the Medical Imaging Lab, Indian Institute of Science, Bangalore.
If you’d like to know more about my work or discuss potential collaborations, please check out my CV. I’m always open to new opportunities and interesting conversations!
Publications
2024
2023
Patents
- PLD+: Accelerating LLM inference by leveraging the hidden states of Language Models (US Patent App. 18/924,398)
- Evidence Retrieval for Long Document Question Answering Using Large Language Models (US Patent App. 18/508,437)
- Automatic generation of handouts from multi-modal documents (US Patent App. 18/542,161)
- Merging misidentified text structures in a document (US Patent App. 18/511,111)
- Generating targeted layouts from source documents utilizing large language models with semantic hierarchical transformations (US Patent App. 18/809,147)
- Generating a digital poster including multimodal content extracted from a source document
- Document-based presentation generation (US Patent App. 18/675,451)