Aditi Chaudhary

Hey! I am a Research Scientist at Google Research where I am exploring methods to generate synthetic data for training better models. I graduated with a Ph.D. in the Language and Information Technologies from the Language Technologies Institute of Carnegie Mellon University, where I was advised by Graham Neubig.

During my PhD I worked around developing methods/tools for automatically extracting language descritions such as grammar rules (morphology agreement, word order, case marking, lexical semantics) from natural language text for the purpose of language documentation and learning. Since I am interested in extracting these descriptions for all languages of the world, much of my research is also focussed on applying these to languages which are under-resourced. Specifically, I have explored techniques such as transfer-learning and active learning for building better models for under-resourced languages, and have applied them to natural language processing (NLP) applications such as named entity recognition (NER), part-of-speech (POS) and morphological analysis.

You can find my CV here and a list of my publications there. Please refer to my Google Scholar page for an updated list.

I have also developed an interface to explore and visualize the extracted language descriptions across many languages, click here to find your language! (If you don’t find your language do reach out and I will add it!)

You can reach me at aditichaud {at} google {dot} com.

News

Starting the ACL SIGEL Speaker Series: First talk by Marc Durdin on 18th July 2023, check talk details here.
New paper: Exploring the Viability of Synthetic Query Generation for Relevance Prediction accepted as a long paper at SIGIR eComm workshop 2023. [pdf]
Honored to the Secretary for ACL SIGEL!
Started a new job at Google Research!
Co-Organizing ComputEL-6 Workshop at at ICLDC 2023, held from March 5-6 virtually!
New paper: Salient Span Masking for Temporal Understanding accepted as a short paper ar EACL 2023. [pdf]
Co-Organizing ComputEL-5 Workshop at ACL 2022, held from May 26-27 in Dublin!
New paper: When is Wall a Pared and when a Muro?:Extracting Rules Governing Lexical Selection accepted as a long paper at EMNLP 2021. [pdf]
Interned at Google Research, India in Summer 2021: Worked on exploring temporal understanding in LLMs.
New paper: Reducing Confusion in Active Learning for Part-Of-Speech Tagging accepted as a long paper at TACL 2020. [pdf]
Grateful to the Waibel Presidential Fellowship for supporting my research.
New paper: Automatic Extraction of Rules Governing Morphological Agreement accepted as a long paper at EMNLP 2020. [pdf]
Interned at Google Research, MTV in Summer 2020: Worked on improving pre-trained multilingual models [pdf]