← Home

Publications

* denotes equal contribution

Mind the Gap! Pathways Towards Unifying AI Safety and Ethics Research

Mind the Gap! Pathways Towards Unifying AI Safety and Ethics Research

Dani Roytburg* and Beck Miller*

Proceedings of the International Association for Safe and Ethical AI, 2026

Words and Action: Modeling Linguistic Leadership in # BlackLivesMatter Communities

Words and Action: Modeling Linguistic Leadership in # BlackLivesMatter Communities

Dani Roytburg*, Deborah Olorunisola*, Sandeep Soni, and Lauren Klein

Proceedings of the International AAAI Conference on Web and Social Media, 2025

Breaking the Mirror: Examining Self-Preference in LLM Evaluators through Activation-Based Representations

Breaking the Mirror: Examining Self-Preference in LLM Evaluators through Activation-Based Representations

Dani Roytburg*, Matthew Bozoukov*, Hongyu Fu, Matthew Nguyen*, Jou Barzdukas*, and Narmeen Fatimah Oozeer

Mechanistic Interpretability Workshop at NeurIPS 2025, 2025

Also at: NeurIPS 2025 Workshop on Evaluating the Evolving LLM Lifecycle; NeurIPS 2025 Workshop on Reliable ML from Unreliable Data

Generative Argument Mining: Pretrained Language Models are Argumentative Text Parsers

Daniel Roytburg

Undergraduate Thesis, Emory University, 2025