Paper Presentations
Oral Presentations (+12 minute video; poster on request given space)
If you are presenting an oral talk, please use this form to confirm attendance mode and optionally request an additional poster slot.
3D Rotation and Translation for Hyperbolic Knowledge Graph Embedding
A Comparative Analysis of Conversational Large Language Models in Knowledge-Based Text Generation
A Comparative Multidimensional Analysis of Empathetic Systems
A Comprehensive Survey of Sentence Representations: From the BERT Epoch to the CHATGPT Era and Beyond
A Prompt Response to the Demand for Automatic Gender-Neutral Translation
A Truly Joint Neural Architecture for Segmentation and Parsing
A Weak Supervision Approach for Few-Shot Aspect Based Sentiment Analysis
A* shortest string decoding for non-idempotent semirings
Accurate and Well-Calibrated ICD Code Assignment Through Attention Over Diverse Label Embeddings
Align and Augment: Generative Data Augmentation for Compositional Generalization
Aligning Large and Small Language Models via Chain-of-Thought Reasoning
Ameli: Enhancing Multimodal Entity Linking with Fine-Grained Attributes
An Empirical Analysis of Diversity in Argument Summarization
AnaDE1.0: A Novel Data Set for Benchmarking Analogy Detection and Extraction
Analyzing the Evaluation of Cross-Lingual Knowledge Transfer in Multilingual Language Models
Anchor Points: Benchmarking Models with Much Fewer Examples
AnthroScore: A Computational Linguistic Measure of Anthropomorphism
Archer: A Human-Labeled Text-to-SQL Dataset with Arithmetic, Commonsense and Hypothetical Reasoning
Asking the Right Question at the Right Time: Human and Model Uncertainty Guidance to Ask Clarification Questions
Automated Cognate Detection as a Supervised Link Prediction Task with Cognate Transformer
Bias in Opinion Summarisation from Pre-training to Adaptation: A Case Study in Political Bias
Can we obtain significant success in RST discourse parsing by using Large Language Models?
CATfOOD: Counterfactual Augmented Training for Improving Out-of-Domain Performance and Calibration
Centering the Speech Community
CEV-LM: Controlled Edit Vector Language Model for Shaping Natural Language Generations
Chaining Event Spans for Temporal Relation Grounding
Characterizing the Confidence of Large Language Model-Based Automatic Evaluation Metrics
Clever Hans or Neural Theory of Mind? Stress Testing Social Reasoning in Large Language Models
Code-Switched Language Identification is Harder Than You Think
Commonsense-augmented Memory Construction and Management in Long-term Conversations via Context-aware Persona Refinement
Contrastive Decoding Reduces Hallucinations in Large Multilingual Machine Translation Models
Counterfactual Reasoning with Knowledge Graph Embeddings
Cross-Lingual Transfer from Related Languages: Treating Low-Resource Maltese as Multilingual Code-Switching
Defending Against Disinformation Attacks in Open-Domain Question Answering
“Define Your Terms” : Enhancing Efficient Offensive Speech Classification with Definition
Diffusion-NAT: Self-Prompting Discrete Diffusion for Non-Autoregressive Text Generation
Disentangling the Roles of Target-side Transfer and Regularization in Multilingual Machine Translation
Do Moral Judgment and Reasoning Capability of LLMs Change with Language? A Study using the Multilingual Defining Issues Test
Document Structure in Long Document Transformers
Dynamic Masking Rate Schedules for MLM Pretraining
Enhancing Ethical Explanations of Large Language Models through Iterative Symbolic Refinement
Examining Gender and Racial Bias in Large Vision--Language Models Using a Novel Dataset of Parallel Images
Explaining Speech Classification Models via Word-Level Audio Segments and Paralinguistic Features
Exploring Data Augmentation in Neural DRS-to-Text Generation
Extreme Fine-tuning: A Novel and Fast Fine-tuning Approach for Text Classification
FAIR: Filtering of Automatically Induced Rules
Few-Shot Data Synthesis for Open Domain Multi-Hop Question Answering
Finding a Needle in the Adversarial Haystack: A Targeted Paraphrasing Approach For Uncovering Edge Cases with Minimal Distribution Distortion
Fine-Grained Natural Language Inference Based Faithfulness Evaluation for Diverse Summarisation Tasks
Fréchet Distance for Offline Evaluation of Information Retrieval Systems with Sparse Labels
Frequency Explains the Inverse Correlation of Large Language Models' Size, Training Data Amount, and Surprisal's Fit to Reading Times
From Partial to Strictly Incremental Constituent Parsing
From Text Segmentation to Smart Chaptering: A Novel Benchmark for Structuring Video Transcriptions
Generation and Polynomial Parsing of Graph Languages with Non-Structural Reentrancies
Generation, Distillation and Evaluation of Motivational Interviewing-Style Reflections with a Foundational Language Model
Generative Dense Retrieval: Memory Can Be a Burden
Geo-Encoder: A Chunk-Argument Bi-Encoder Framework for Chinese Geographic Re-Ranking
Gradient-Based Language Model Red Teaming
Graph Guided Question Answer Generation for Procedural Question-Answering
Graph-based Clustering for Detecting Semantic Change Across Time and Languages
HiGen: Hierarchy-Aware Sequence Generation for Hierarchical Text Classification
How Transferable are Attribute Controllers on Pretrained Multilingual Translation Models?
HumBEL: A Human-in-the-Loop Approach for Evaluating Demographic Factors of Language Models in Human-Machine Conversations
Identifying Narrative Content in Podcast Transcripts
Improving Acoustic Word Embeddings through Correspondence Training of Self-supervised Speech Representations
Improving Contrastive Learning in Emotion Recognition in Conversation via Data Augmentation and Decoupled Neutral Emotion
Improving Generalization in Semantic Parsing by Increasing Natural Language Variation
Improving the TENOR of Labeling: Re-evaluating Topic Models for Content Analysis
Injecting Wiktionary to improve token-level contextual representations using contrastive learning
It is not True that Transformers are Inductive Learners: Probing NLI Models with External Negation
It's All Relative: Learning Interpretable Models for Scoring Subjective Bias in Documents from Pairwise Comparisons
'It's how you do things that matters'': Attending to Process to Better Serve Indigenous Communities with Language Technologies
Karde\c{s}-NLU: Transfer to Low-Resource Languages with Big Brother's Help -- A Benchmark and Evaluation for Turkic Languages
LaMini-LM: A Diverse Herd of Distilled Models from Large-Scale Instructions
Large-Scale Bitext Corpora Provide New Evidence for Cognitive Representations of Spatial Terms
Large-Scale Label Interpretation Learning for Few-Shot Named Entity Recognition
Leak, Cheat, Repeat: Data Contamination and Evaluation Malpractices in Closed-Source LLMs
LegalLens: Leveraging LLMs for Legal Violation Identification in Unstructured Text
Less is More for Long Document Summary Evaluation by LLMs
Leveraging ChatGPT in Pharmacovigilance Event Extraction: An Empirical Study
Leveraging fine-tuned Large Language Models with LoRA for Effective Claim, Claimer, and Claim Object Detection
Leveraging Implicit Feedback from Deployment Data in Dialogue
LOCOST: State-Space Models for Long Document Abstractive Summarization
Lost in Translationese? Reducing Translation Effect Using Abstract Meaning Representation
M4: Multi-generator, Multi-domain, and Multi-lingual Black-Box Machine-Generated Text Detection
MAFIA: Multi-Adapter Fused Inclusive Language Models
Meme-ingful Analysis: Enhanced Understanding of Cyberbullying in Memes Through Multimodal Explanations
Mitigating Hallucinations and Off-target Machine Translation with Source-Contrastive and Language-Contrastive Decoding
MLCopilot: Unleashing the Power of Large Language Models in Solving Machine Learning Tasks
Moderation in the Wild: Investigating User-Driven Moderation in Online Discussions
Multi-Level Attention Aggregation for Language-Agnostic Speaker Replication
Multilingual Gradient Word-Order Typology from Universal Dependencies
Multimodal Fallacy Classification in Political Debates
MultiMUC: Multilingual Template Filling on MUC-4
Neuralign: A Context-Aware, Cross-Lingual and Fully-Neural Sentence Alignment System for Long Texts
NevIR: Negation in Neural Information Retrieval
NNOSE: Nearest Neighbor Occupational Skill Extraction
No Error Left Behind: Multilingual Grammatical Error Correction with Pre-trained Translation Models
On the Benefits of Fine-Grained Loss Truncation: A Case Study on Factuality in Summarization
OpenPI2.0: An Improved Dataset for Entity Tracking in Texts
Over-Reasoning and Redundant Calculation of Large Language Models
Plan-Grounded Large Language Models for Dual Goal Conversational Settings
Pre-Training Methods for Question Reranking
Predict the Next Word: Humans exhibit uncertainty in this task and language models
Predicting Client Emotions and Therapist Interventions in Psychotherapy Dialogues
Presentations by the Humans and For the Humans: Harnessing LLMs for Generating Persona-Aware Slides from Documents
Putting Context in Context: the Impact of Discussion Structure on Text Classification
Quality Does Matter: A Detailed Look at the Quality and Utility of Web-Mined Parallel Corpora
Quantifying the Hyperparameter Sensitivity of Neural Networks for Character-level Sequence-to-Sequence Tasks
REFINER: Reasoning Feedback on Intermediate Representations
Rethinking Loss Functions for Fact Verification
Robust Neural Machine Translation for Abugidas by Glyph Perturbation
Scaling up Discovery of Latent Concepts in Deep NLP Models
Semantic Sensitivities and Inconsistent Predictions: Measuring the Fragility of NLI Models
Sensitivity, Performance, Robustness: Deconstructing the Effect of Sociodemographic Prompting
Sentence Representations via Gaussian Embedding
SentenceLDA: Discriminative and Robust Document Representation with Sentence Level Topic Model
SIB-200: A Simple, Inclusive, and Big Evaluation Dataset for Topic Classification in 200+ Languages and Dialects
Small Language Models Improve Giants by Rewriting Their Outputs
SOCIALITE-LLAMA: An Instruction-Tuned Model for Social Scientific Tasks
STORiCo: Storytelling TTS for Hindi with Character Voice Modulation
Syntactic Preposing and Discourse Relations
SynthDST: Synthetic Data is All You Need for Few-Shot Dialog State Tracking
TESS: Text-to-Text Self-Conditioned Simplex Diffusion
Text-Guided Image Clustering
Text-to-Code Generation with Modality-relative Pre-training
Text-to-OverpassQL: A Natural Language Interface for Complex Geodata Querying of OpenStreetMap
The Parrot Dilemma: Human-Labeled vs. LLM-augmented Data in Classification Tasks
The Role of Data Curation in Image Captioning
Think Twice: Measuring the Efficiency of Eliminating Prediction Shortcuts of Question Answering Models
ToPro: Token-Level Prompt Decomposition for Cross-Lingual Sequence Labeling Tasks
Towards Hierarchical Spoken Language Disfluency Modeling
Tracing the Roots of Facts in Multilingual Language Models: Independent, Shared, and Transferred Knowledge
Translate to Disambiguate: Zero-shot Multilingual Word Sense Disambiguation with Pretrained Language Models
Translation Errors Significantly Impact Low-Resource Languages in Cross-Lingual Learning
Uncovering Stereotypes in Large Language Models: A Task Complexity-based Approach
Unintended Bias Detection and Mitigation in Misogynous Memes
Unleashing the Power of Discourse-Enhanced Transformers for Propaganda Detection
UNSEE: Unsupervised Non-contrastive Sentence Embeddings
Unsupervised Contrast-Consistent Ranking with Language Models
Unsupervised stance detection for social media discussions: A generic baseline
VEIL: Vetting Extracted Image Labels from In-the-Wild Captions for Weakly-Supervised Object Detection
What Makes Medical Claims (Un)Verifiable? Analyzing Entity and Relation Properties for Fact Verification
WSC+: Enhancing The Winograd Schema Challenge Using Tree-of-Experts
Poster Presentations (+ 12 minute video recording)
A Classification-Guided Approach for Adversarial Attacks against Neural Machine Translation
A Dataset for Metaphor Detection in Early Medieval Hebrew Poetry
A Multimodal Framework to Detect Target Aware Aggression in Memes
A RelEntLess Benchmark for Modelling Graded Relations between Named Entities
“According to . . . ”: Prompting Language Models Improves Quoting from Pre-Training Data
Advancing Precise Outline-Conditioned Text Generation with Task Duality and Explicit Outline Control
Anisotropy Is Inherent to Self-Attention in Transformers
Answering legal questions from laymen in German civil law system
Approximate Attributions for Off-the-Shelf Siamese Transformers
Are Character-level Translations Worth the Wait? Comparing Pretrained Character- and Subword-level Models for Machine Translation
Argument Mining as a Text-to-Text Generation Task
Ask, Assess, and Refine: Rectifying Factual Consistency and Hallucination in LLMs with Metric-Guided Feedback Learning
Backward Compatibility During Data Updates by Weight Interpolation
CCPrefix: Counterfactual Contrastive Prefix-Tuning for Many-Class Classification
CEAN: Contrastive Event Aggregation Network with LLM-based Augmentation for Event Extraction
CharSpan: Utilizing Lexical Similarity to Enable Zero-Shot Machine Translation for Extremely Low-resource Languages
Comparing Knowledge Sources for Open-Domain Scientific Claim Verification
Comparing Template-based and Template-free Language Model Probing
ConstraintChecker: A Plugin for Large Language Models to Reason on Commonsense Knowledge Bases
Corpus-Steered Query Expansion with Large Language Models
Creating Suspenseful Stories: Iterative Planning with Large Language Models
Describing Images $\textit{Fast and Slow}$: Quantifying and Predicting the Variation in Human Signals during Visuo-Linguistic Processes
Desiderata For The Context Use Of Question Answering Systems
Discovering and Articulating Frames of Communication from Social Media Using Chain-of-Thought Reasoning
Do Text Simplification Systems Convey Correct Information? A Human Evaluation via Reading Comprehension
Effective Controllable Bias Mitigation for Classification and Retrieval using Gate Adapters
EnCore: Fine-Grained Entity Typing by Pre-Training Entity Encoders on Coreference Chains
Entity-level Factual Adaptiveness of Fine-tuning based Abstractive Summarization Models
Equipping Language Models with Tool Use Capability for Tabular Data Analysis in Finance
Evaluating the Factuality of Zero-shot Summarizers Across Varied Domains
Evaluating Unsupervised Argument Aligners via Generation of Conclusions of Structured Scientific Abstracts
EXPLORER: Exploration-guided Reasoning for Textual Reinforcement Learning
Exploring the Robustness of Task-oriented Dialogue Systems for Colloquial German Varieties
Few-Shot Dialogue Summarization via Skeleton-Assisted Prompt Transfer in Prompt Tuning
FinBPM: A Framework for Portfolio Management-based Financial Investor Behavior Perception Model
Flow Matching for Conditional Text Generation in a Few Sampling Steps
French GossipPrompts: Dataset For Prevention of Generating French Gossip Stories By LLMs
GAINER: Graph Machine Learning with Node-specific Radius for Classification of Short Texts and Documents
GEAR: Augmenting Language Models with Generalizable and Efficient Tool Resolution
Generating Benchmarks for Factuality Evaluation of Language Models
Generation-driven Contrastive Self-training for Zero-shot Text Classification with Instruction-following LLM
GUMsley: Evaluating Entity Salience in Summarization for 12 English Genres
Human Temporal Inferences Go Beyond Aspectual Class
Importance-Aware Data Augmentation for Document-Level Neural Machine Translation
Interpreting Predictive Probabilities: Model Confidence or Human Label Variation?
Investigating Agency of LLMs in Human-AI Collaboration Tasks
Investigating Content Planning for Navigating Trade-offs in Knowledge-Grounded Dialogue
Investigating the Potential of Task Arithmetic for Cross-Lingual Transfer
Language Model Sentence Completion with a Parser-Driven Rhetorical Control Method
Language Models as Inductive Reasoners
LAraBench: Benchmarking Arabic AI with Large Language Models
Learning to Retrieve In-Context Examples for Large Language Models
Leveraging Multi-lingual Positive Instances in Contrastive Learning to Improve Sentence Embedding
Like a Good Nearest Neighbor: Practical Content Moderation and Text Classification
LLM Comparative Assessment: Zero-shot NLG Evaluation through Pairwise Comparisons using Large Language Models
Localization vs. Semantics: Visual Representations in Unimodal and Multimodal Models
Measuring Uncertainty in Neural Machine Translation with Similarity-Sensitive Entropy
More Discriminative Sentence Embeddings via Semantic Graph Smoothing
$\mu$PLAN: Summarizing using a Content Plan as Cross-Lingual Bridge
Multi-Reference Benchmarks for Russian Grammatical Error Correction
Multi-Relational Hyperbolic Word Embeddings from Natural Language Definitions
Parameter-Efficient Conversational Recommender System as a Language Processing Task
PEARL: Prompting Large Language Models to Plan and Execute Actions Over Long Documents
Polarized Opinion Detection Improves the Detection of Toxic Language
Quantifying Stereotypes in Language
Rainbow - A Benchmark for Systematic Testing of How Sensitive Visio-Linguistic Models are to Color Naming
SCO-VIST: Social Interaction Commonsense Knowledge-based Visual Storytelling
Should I try multiple optimizers when fine-tuning a pre-trained Transformer for NLP tasks? Should I tune their hyperparameters?
Smaller Language Models are Better Zero-shot Machine-Generated Text Detectors
Source Identification in Abstractive Summarization
SPUQ: Perturbation-Based Uncertainty Quantification for Large Language Models
STable: Table Generation Framework for Encoder-Decoder Models
Style-News: Incorporating Stylized News Generation and Adversarial Verification for Neural Fake News Detection
System-Level Natural Language Feedback
Threat Behavior Textual Search by Attention Graph Isomorphism
UP5: Unbiased Foundation Model for Fairness-aware Recommendation
ViLexNorm: A Lexical Normalization Corpus for Vietnamese Social Media Text
VlogQA: Task, Dataset, and Baseline Models for Vietnamese Spoken-Based Machine Reading Comprehension
VOLTAGE: A Versatile Contrastive Learning based OCR Methodology for ultra low-resource scripts through Auto Glyph Feature Extraction
Where Do We Go From Here? Multi-scale Allocentric Relational Inferencefrom Natural Spatial Descriptions
Who Needs Decoders? Efficient Estimation of Sequence-Level Attributes with Proxies
Zero-Shot End-to-End Spoken Language Understanding via Cross-Modal Selective Self-Training
Zero-shot Sentiment Analysis in Low-Resource Languages Using a Multilingual Sentiment Lexicon