Thennal D K [ˈθɛnnel]

I’m Thennal (any/all), a CS undergrad at the Indian Institute of Information Technology Kottayam, conducting natural language processing research since 2018. I’m interested in what makes language/speech models tick, and how to make them tick better. In particular:
- How do large pretrained models form their internal representations, and how does each component update it?
- There are a lot of pretrained and finetuned models available publicly. Can we use them to make better models?
- The field has a significant evaluation/benchmarking problem, particularly when it comes to non-English languages. How can we make it better?
I also like running, fungi, anything produced by Supergiant Games, and Japanese music. Go watch Etsuko Yakushimaru’s I’m Humanity, and then read about it.
news
Jan 22, 2025 | Our paper on ASR evaluation metrics was accepted to NAACL Findings 2025! |
---|---|
Oct 18, 2024 | Two new preprints, related to my internship with the University of Hamburg and my collaboration Jesin James from the University of Auckland. |
Feb 20, 2024 | Paper accepted at LREC-COLING 2024! Excited to go there in May and present our work, Fisher Mask Nodes for Language Model Merging. |
Feb 17, 2024 | Got the DAAD WISE scholarship for an internship with the University of Hamburg! |
latest posts
Oct 24, 2024 | The lost art of checking your sources |
---|---|
May 01, 2024 | Why do I chase the heat? |
Dec 29, 2022 | Whisper's evaluated metrics are kind of wrong for a bunch of languages |