publications

Publications in reversed chronological order. * indicates equal contribution

2025

  1. Causally Modeling the Linguistic and Social Factors that Predict Email Response
    Yinuo Xu*, Hong Chen*, Sushrita Rakshit*, Aparna Ananthasubramaniam*, Omkar Yadav*, Mingqian Zheng*, Michael Jiang*Lechen Zhang*, Bowen Yi*, Kenan Alkiek*, Abraham Israeli*, Bangzhao Shu*, Hua Shen*, Jiaxin Pei*, Haotian Zhang*, Miriam Schirmer*, and David Jurgens
    In Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), Apr 2025
  2. Under Review
    latent_geo_preview.jpg
    Latent Geographies: Joint Embeddings of Text and Visual Cues for Social Media Geolocation
    Lechen Zhang*, Abraham Israeli*, Rohan Raju, and David Jurgens
    May 2025
  3. Preprint
    Under Review
    verifact_preview.jpg
    VeriFact: Enhancing Long-Form Factuality Evaluation with Refined Fact Extraction and Reference Facts
    Xin Liu, Lechen Zhang, Sheza Munir, Yiyang Gu, and Lu Wang
    arXiv preprint arXiv:2505.09701, May 2025

2024

  1. NAACL 2024
    Oral
    llm_personas_preview.jpg
    You don’t need a personality test to know these models are unreliable: Assessing the Reliability of Large Language Models on Psychometric Instruments
    Bangzhao Shu*Lechen Zhang*, Minje Choi, Lavinia Dunagan, Lajanugen Logeswaran, Moontae Lee, Dallas Card, and David Jurgens
    In Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics (NAACL), Jun 2024
  2. FactBench: A Dynamic Benchmark for In-the-Wild Language Model Factuality Evaluation
    Farima Fatahi Bayat, Lechen Zhang, Sheza Munir, and Lu Wang
    arXiv preprint arXiv:2410.22257, Oct 2024
  3. Towards Global AI Inclusivity: A Large-Scale Multilingual Terminology Dataset (GIST)
    Jiarui Liu*, Iman Ouzzani*, Wenkai Li*Lechen Zhang, Tianyue Ou, Houda Bouamor, Zhijing Jin, and Mona Diab
    arXiv preprint arXiv:2412.18367, Dec 2024
  4. Preprint
    Under Review
    sprig_preview.jpg
    SPRIG: Improving Large Language Model Performance by System Prompt Optimization
    Lechen Zhang, Tolga Ergen, Lajanugen Logeswaran, Moontae Lee, and David Jurgens
    arXiv preprint arXiv:2410.14826, Oct 2024
  5. Preprint
    Under Review
    human_simulation_preview.jpg
    Real or Robotic? Assessing Whether LLMs Accurately Simulate Qualities of Human Responses in Dialogue
    Jonathan Ivey*, Shivani Kumar*, Jiayu Liu*, Hua Shen*, Sushrita Rakshit*, Rohan Raju*, Haotian Zhang*, Aparna Ananthasubramaniam*, Junghwan Kim*, Bowen Yi*, Dustin Wright*, Abraham Israeli*, Anders Giovanni Møller*Lechen Zhang*, and David Jurgens
    arXiv preprint arXiv:2409.08330, Sep 2024