2024
-
MCECR: A Novel Dataset for Multilingual Cross-Document Event Coreference Resolution
Amir Pouran Ben Veyseh, Viet Dac Lai, Chien Van Nguyen, Franck Dernoncourt, Thien Huu Nguyen | NAACL (Findings) | pdf [To appear]
-
CulturaX: A Cleaned, Enormous, and Multilingual Dataset for Large Language Models in 167 Languages
Thuat Nguyen, Chien Van Nguyen, Viet Dac Lai, Hieu Man, Nghia Trung Ngo, Franck Dernoncourt, Ryan A. Rossi, Thien Huu Nguyen | LREC-COLING | pdf
-
CAMAL: A Novel Dataset for Multi-label Conversational Argument Move Analysis
Viet Dac Lai, Duy Pham, Jonathan Steinberg, Jamie Mikeska, Thien Huu Nguyen | LREC-COLING | pdf [To appear]
-
DocFinQA: A Long-Context Financial Reasoning Dataset
Varshini Reddy, Rik Koncel-Kedziorski, Viet Dac Lai, Chris Tanner | Preprint | pdf
-
BizBench: A Quantitative Reasoning Benchmark for Business and Finance
Rik Koncel-Kedziorski, Michael Krumdick, Viet Dac Lai, Varshini Reddy, Charles Lovering, Chris Tanner | Preprint | pdf
-
Using Machine Learning to Detect Student Learning Levels along a Learning Progression
Duy Pham, Viet Dac Lai | NCME 2024 | pdf