Sedrick Keh

I'm currently a research engineer at Toyota Research Institute working on pre-training, post-training, and multimodal models. I completed my MS at the Machine Learning Department at Carnegie Mellon University (CMU), where I did research on multimodal models and grounding with Daniel Fried, as well as on AI for social good Fei Fang. I previously received my Bachelor's degree in Mathematics and Data Science from the Hong Kong University of Science and Technology (HKUST).

Email  /  Google Scholar  /  Twitter  /  GitHub  /  CV

profile photo
News
Research

These days I'm very interested in pretraining, post-training, and evaluation. I'm specifically interested in multimodal models and applications to robotics and embodied systems. I have previously done work on grounding (NAACL Findings 2024), as well as on controllable text generation and evaluation (COLING 2022, EACL 2023).

Publications

2025

  • Should VLMs be Pre-trained with Image Data?
    Sedrick Keh*, Jean Mercat*, Samir Yitzhak Gadre, Kushal Arora, Igor Vasiljevic, Benjamin Burchfiel, Shuran Song, Russ Tedrake, Thomas Kollar, Ludwig Schmidt, Achal Dave
    ICLR 2025
    pdf | abstract

2024

  • DataComp-LM: In search of the next generation of training sets for language models
    DCLM team
    NeurIPS Datasets and Benchmarks 2024
    pdf | abstract

  • SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian Languages
    SEACrowd team
    EMNLP 2024
    pdf | abstract

  • Linearizing Large Language Models
    Jean Mercat*, Igor Vasiljevic*, Sedrick Keh*, Kushal Arora, Achal Dave, Adrien Gaidon, Thomas Kollar
    COLM 2024
    pdf | abstract

  • Language Models Scale Reliably with Over-training and on Downstream Tasks
    Samir Yitzhak Gadre, Georgios Smyrnis, Vaishaal Shankar, Suchin Gururangan, Mitchell Wortsman, Rulin Shao, Jean Mercat, Alex Fang, Jeffrey Li, Sedrick Keh, Rui Xin, Marianna Nezhurina, Igor Vasiljevic, Jenia Jitsev, Alexandros G Dimakis, Gabriel Ilharco, Shuran Song, Thomas Kollar, Yair Carmon, Achal Dave, Reinhard Heckel, Niklas Muennighoff, Ludwig Schmidt
    ICLR 2025
    pdf | abstract

  • A Critical Evaluation of AI Feedback for Aligning Large Language Models
    Archit Sharma, Sedrick Keh, Eric Mitchell, Chelsea Finn, Kushal Arora, Thomas Kollar
    NeurIPS 2024
    pdf | abstract

  • Where It Really Matters: Few-Shot Environmental Conservation Media Monitoring for Low-Resource Languages
    Sameer Jain, Sedrick Scott Keh, Shova Chhetri, Karun Dewan, Pablo Izquierdo, Johanna Prussmann, Pooja Shrestha, César Suárez, Zheyuan Ryan Shi, Lei Li, Fei Fang
    AAAI 2024
    pdf | abstract

2023

  • Asking More Informative Questions for Grounded Retrieval
    Sedrick Keh, Justin T Chiu, Daniel Fried
    NAACL Findings 2024
    pdf | abstract

  • Doolittle: Benchmarks and Corpora for Academic Writing Formalization
    Shizhe Diao, Yongyu Lei, Liangming Pan, Tianqing Fang, Wangchunshu Zhou, Sedrick Keh, Min-Yen Kan, Tong Zhang
    EMNLP 2023
    pdf | abstract

  • Hashtag-Guided Low-Resource Tweet Classification
    Shizhe Diao*, Sedrick Scott Keh*, Liangming Pan, Zhiliang Tian, Yan Song, Tong Zhang
    WWW 2023
    pdf | abstract

2022

  • PANCETTA: Phoneme Aware Neural Completion to Elicit Tongue Twisters Automatically
    Sedrick Scott Keh, Steven Y. Feng*, Varun Gangal*, Malihe Alikhani, Eduard Hovy
    EACL 2023
    pdf | abstract | github


  • EUREKA: EUphemism Recognition Enhanced through Knn-based methods and Augmentation
    Sedrick Scott Keh*, Rohit Bharadwaj*, Emmy Liu**, Simone Tedeschi**, Varun Gangal, Roberto Navigli
    EMNLP 2022 FigLang Workshop
    pdf | abstract | github | slides


  • Exploring Euphemism Detection in Few-Shot and Zero-Shot Settings
    Sedrick Scott Keh
    EMNLP 2022 FigLang Workshop
    pdf | abstract | github | poster


  • PINEAPPLE: Personifying INanimate Entities by Acquiring Parallel Personification Data for Learning Enhanced Generation
    Sedrick Scott Keh, Kevin Lu, Steven Y. Feng*, Varun Gangal*, Harsh Jhamtani, Malihe Alikhani, Eduard Hovy
    COLING 2022
    pdf | abstract | github | talk | slides | poster


  • NewsPanda: Media Monitoring for Timely Conservation Action
    Sedrick Scott Keh*, Zheyuan Ryan Shi*, David J. Patterson, Nirmal Bhagabati, Karun Dewan, Areendran Gopala, Pablo Izquierdo, Debojyoti Mallick, Ambika Sharma, Pooja Shrestha, Fei Fang
    IAAI 2023
    pdf | abstract | github | website


2021 and older

  • Semi-supervised Noisy Student Pre-training on EfficientNet Architectures for Plant Pathology Classification
    Sedrick Scott Keh
    arXiv 2020
    pdf | abstract


  • Myers-Briggs Personality Classification and Personality-specific Language Generation Using Pre-trained Language Models
    Sedrick Scott Keh, I-Tsun Cheng
    arXiv 2019
    pdf | abstract




Site Template