Sedrick Keh

I'm currently a research engineer at Toyota Research Institute working on pretraining and on multimodal models. I completed my MS at the Machine Learning Department at Carnegie Mellon University (CMU), where I did research on multimodal models and grounding with Daniel Fried, as well as on AI for social good Fei Fang. I previously received my Bachelor's degree in Data Science and Mathematics from the Hong Kong University of Science and Technology (HKUST).

Email  /  Google Scholar  /  Twitter  /  GitHub  /  CV

profile photo
News
Research

These days I'm very interested in pretraining, alignment, and evaluation. I'm specifically interested in multimodal models and applications to robotics. I have done work on grounding (NAACL Findings 2024), as well as on controllable text generation and evaluation (COLING 2022, EACL 2023).

Publications

2024

  • DataComp-LM: In search of the next generation of training sets for language models
    DCLM team
    pdf | abstract

  • SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian Languages
    SEACrowd team
    pdf | abstract

  • Linearizing Large Language Models
    Jean Mercat*, Igor Vasiljevic*, Sedrick Keh*, Kushal Arora, Achal Dave, Adrien Gaidon, Thomas Kollar
    pdf | abstract

  • Language Models Scale Reliably with Over-training and on Downstream Tasks
    Samir Yitzhak Gadre, Georgios Smyrnis, Vaishaal Shankar, Suchin Gururangan, Mitchell Wortsman, Rulin Shao, Jean Mercat, Alex Fang, Jeffrey Li, Sedrick Keh, Rui Xin, Marianna Nezhurina, Igor Vasiljevic, Jenia Jitsev, Alexandros G Dimakis, Gabriel Ilharco, Shuran Song, Thomas Kollar, Yair Carmon, Achal Dave, Reinhard Heckel, Niklas Muennighoff, Ludwig Schmidt
    pdf | abstract

  • A Critical Evaluation of AI Feedback for Aligning Large Language Models
    Archit Sharma, Sedrick Keh, Eric Mitchell, Chelsea Finn, Kushal Arora, Thomas Kollar
    pdf | abstract

  • Where It Really Matters: Few-Shot Environmental Conservation Media Monitoring for Low-Resource Languages
    Sameer Jain, Sedrick Scott Keh, Shova Chhetri, Karun Dewan, Pablo Izquierdo, Johanna Prussmann, Pooja Shrestha, César Suárez, Zheyuan Ryan Shi, Lei Li, Fei Fang
    The 38th Annual AAAI Conference on Artificial Intelligence (AAAI 2024).
    pdf | abstract

2023

  • Asking More Informative Questions for Grounded Retrieval
    Sedrick Keh, Justin T Chiu, Daniel Fried
    Findings of the 2024 Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL Findings 2024).
    pdf | abstract

  • Doolittle: Benchmarks and Corpora for Academic Writing Formalization
    Shizhe Diao, Yongyu Lei, Liangming Pan, Tianqing Fang, Wangchunshu Zhou, Sedrick Keh, Min-Yen Kan, Tong Zhang
    The 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP 2023).
    pdf | abstract

  • Hashtag-Guided Low-Resource Tweet Classification
    Shizhe Diao*, Sedrick Scott Keh*, Liangming Pan, Zhiliang Tian, Yan Song, Tong Zhang
    Proceedings of the ACM Web Conference 2023 (WWW 2023).
    pdf | abstract

2022

  • PANCETTA: Phoneme Aware Neural Completion to Elicit Tongue Twisters Automatically
    Sedrick Scott Keh, Steven Y. Feng*, Varun Gangal*, Malihe Alikhani, Eduard Hovy
    The 17th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2023).
    pdf | abstract | github


  • EUREKA: EUphemism Recognition Enhanced through Knn-based methods and Augmentation
    Sedrick Scott Keh*, Rohit Bharadwaj*, Emmy Liu**, Simone Tedeschi**, Varun Gangal, Roberto Navigli
    The 3rd Workshop on Figurative Language Processing (EMNLP 2022 FigLang Workshop)
    pdf | abstract | github | slides


  • Exploring Euphemism Detection in Few-Shot and Zero-Shot Settings
    Sedrick Scott Keh
    The 3rd Workshop on Figurative Language Processing (EMNLP 2022 FigLang Workshop)
    pdf | abstract | github | poster


  • PINEAPPLE: Personifying INanimate Entities by Acquiring Parallel Personification Data for Learning Enhanced Generation
    Sedrick Scott Keh, Kevin Lu, Steven Y. Feng*, Varun Gangal*, Harsh Jhamtani, Malihe Alikhani, Eduard Hovy
    Proceedings of the 29th International Conference on Computational Linguistics (COLING 2022)
    pdf | abstract | github | talk | slides | poster


  • NewsPanda: Media Monitoring for Timely Conservation Action
    Sedrick Scott Keh*, Zheyuan Ryan Shi*, David J. Patterson, Nirmal Bhagabati, Karun Dewan, Areendran Gopala, Pablo Izquierdo, Debojyoti Mallick, Ambika Sharma, Pooja Shrestha, Fei Fang
    The 35th Annual Conference on Innovative Applications of Artificial Intelligence (IAAI 2023)
    pdf | abstract | github | website


2021 and older

  • Semi-supervised Noisy Student Pre-training on EfficientNet Architectures for Plant Pathology Classification
    Sedrick Scott Keh
    arXiv 2020
    pdf | abstract


  • Myers-Briggs Personality Classification and Personality-specific Language Generation Using Pre-trained Language Models
    Sedrick Scott Keh, I-Tsun Cheng
    arXiv 2019
    pdf | abstract




Site Template