Paper Presentations

Oral Presentations (+12 minute video; poster on request given space)

If you are presenting an oral talk, please use this form to confirm attendance mode and optionally request an additional poster slot.

3D Rotation and Translation for Hyperbolic Knowledge Graph Embedding

A Comparative Analysis of Conversational Large Language Models in Knowledge-Based Text Generation

A Comparative Multidimensional Analysis of Empathetic Systems

A Comprehensive Survey of Sentence Representations: From the BERT Epoch to the CHATGPT Era and Beyond

A Prompt Response to the Demand for Automatic Gender-Neutral Translation

A Truly Joint Neural Architecture for Segmentation and Parsing

A Weak Supervision Approach for Few-Shot Aspect Based Sentiment Analysis

A* shortest string decoding for non-idempotent semirings

Accurate and Well-Calibrated ICD Code Assignment Through Attention Over Diverse Label Embeddings

Align and Augment: Generative Data Augmentation for Compositional Generalization

Aligning Large and Small Language Models via Chain-of-Thought Reasoning

Ameli: Enhancing Multimodal Entity Linking with Fine-Grained Attributes

An Empirical Analysis of Diversity in Argument Summarization

AnaDE1.0: A Novel Data Set for Benchmarking Analogy Detection and Extraction

Analyzing the Evaluation of Cross-Lingual Knowledge Transfer in Multilingual Language Models

Anchor Points: Benchmarking Models with Much Fewer Examples

AnthroScore: A Computational Linguistic Measure of Anthropomorphism

Archer: A Human-Labeled Text-to-SQL Dataset with Arithmetic, Commonsense and Hypothetical Reasoning

Asking the Right Question at the Right Time: Human and Model Uncertainty Guidance to Ask Clarification Questions

Automated Cognate Detection as a Supervised Link Prediction Task with Cognate Transformer

Bias in Opinion Summarisation from Pre-training to Adaptation: A Case Study in Political Bias

Can we obtain significant success in RST discourse parsing by using Large Language Models?

CATfOOD: Counterfactual Augmented Training for Improving Out-of-Domain Performance and Calibration

Centering the Speech Community

CEV-LM: Controlled Edit Vector Language Model for Shaping Natural Language Generations

Chaining Event Spans for Temporal Relation Grounding

Characterizing the Confidence of Large Language Model-Based Automatic Evaluation Metrics

Clever Hans or Neural Theory of Mind? Stress Testing Social Reasoning in Large Language Models

Code-Switched Language Identification is Harder Than You Think

Commonsense-augmented Memory Construction and Management in Long-term Conversations via Context-aware Persona Refinement

Contrastive Decoding Reduces Hallucinations in Large Multilingual Machine Translation Models

Counterfactual Reasoning with Knowledge Graph Embeddings

Cross-Lingual Transfer from Related Languages: Treating Low-Resource Maltese as Multilingual Code-Switching

Defending Against Disinformation Attacks in Open-Domain Question Answering

“Define Your Terms” : Enhancing Efficient Offensive Speech Classification with Definition

Diffusion-NAT: Self-Prompting Discrete Diffusion for Non-Autoregressive Text Generation

Disentangling the Roles of Target-side Transfer and Regularization in Multilingual Machine Translation

Do Moral Judgment and Reasoning Capability of LLMs Change with Language? A Study using the Multilingual Defining Issues Test

Document Structure in Long Document Transformers

Dynamic Masking Rate Schedules for MLM Pretraining

Enhancing Ethical Explanations of Large Language Models through Iterative Symbolic Refinement

Examining Gender and Racial Bias in Large Vision--Language Models Using a Novel Dataset of Parallel Images

Explaining Speech Classification Models via Word-Level Audio Segments and Paralinguistic Features

Exploring Data Augmentation in Neural DRS-to-Text Generation

Extreme Fine-tuning: A Novel and Fast Fine-tuning Approach for Text Classification

FAIR: Filtering of Automatically Induced Rules

Few-Shot Data Synthesis for Open Domain Multi-Hop Question Answering

Finding a Needle in the Adversarial Haystack: A Targeted Paraphrasing Approach For Uncovering Edge Cases with Minimal Distribution Distortion

Fine-Grained Natural Language Inference Based Faithfulness Evaluation for Diverse Summarisation Tasks

Fréchet Distance for Offline Evaluation of Information Retrieval Systems with Sparse Labels

Frequency Explains the Inverse Correlation of Large Language Models' Size, Training Data Amount, and Surprisal's Fit to Reading Times

From Partial to Strictly Incremental Constituent Parsing

From Text Segmentation to Smart Chaptering: A Novel Benchmark for Structuring Video Transcriptions

Generation and Polynomial Parsing of Graph Languages with Non-Structural Reentrancies

Generation, Distillation and Evaluation of Motivational Interviewing-Style Reflections with a Foundational Language Model

Generative Dense Retrieval: Memory Can Be a Burden

Geo-Encoder: A Chunk-Argument Bi-Encoder Framework for Chinese Geographic Re-Ranking

Gradient-Based Language Model Red Teaming

Graph Guided Question Answer Generation for Procedural Question-Answering

Graph-based Clustering for Detecting Semantic Change Across Time and Languages

HiGen: Hierarchy-Aware Sequence Generation for Hierarchical Text Classification

How Transferable are Attribute Controllers on Pretrained Multilingual Translation Models?

HumBEL: A Human-in-the-Loop Approach for Evaluating Demographic Factors of Language Models in Human-Machine Conversations

Identifying Narrative Content in Podcast Transcripts

Improving Acoustic Word Embeddings through Correspondence Training of Self-supervised Speech Representations

Improving Contrastive Learning in Emotion Recognition in Conversation via Data Augmentation and Decoupled Neutral Emotion

Improving Generalization in Semantic Parsing by Increasing Natural Language Variation

Improving the TENOR of Labeling: Re-evaluating Topic Models for Content Analysis

Injecting Wiktionary to improve token-level contextual representations using contrastive learning

It is not True that Transformers are Inductive Learners: Probing NLI Models with External Negation

It's All Relative: Learning Interpretable Models for Scoring Subjective Bias in Documents from Pairwise Comparisons

'It's how you do things that matters'': Attending to Process to Better Serve Indigenous Communities with Language Technologies

Karde\c{s}-NLU: Transfer to Low-Resource Languages with Big Brother's Help -- A Benchmark and Evaluation for Turkic Languages

LaMini-LM: A Diverse Herd of Distilled Models from Large-Scale Instructions

Large-Scale Bitext Corpora Provide New Evidence for Cognitive Representations of Spatial Terms

Large-Scale Label Interpretation Learning for Few-Shot Named Entity Recognition

Leak, Cheat, Repeat: Data Contamination and Evaluation Malpractices in Closed-Source LLMs

LegalLens: Leveraging LLMs for Legal Violation Identification in Unstructured Text

Less is More for Long Document Summary Evaluation by LLMs

Leveraging ChatGPT in Pharmacovigilance Event Extraction: An Empirical Study

Leveraging fine-tuned Large Language Models with LoRA for Effective Claim, Claimer, and Claim Object Detection

Leveraging Implicit Feedback from Deployment Data in Dialogue

LOCOST: State-Space Models for Long Document Abstractive Summarization

Lost in Translationese? Reducing Translation Effect Using Abstract Meaning Representation

M4: Multi-generator, Multi-domain, and Multi-lingual Black-Box Machine-Generated Text Detection

MAFIA: Multi-Adapter Fused Inclusive Language Models

Meme-ingful Analysis: Enhanced Understanding of Cyberbullying in Memes Through Multimodal Explanations

Mitigating Hallucinations and Off-target Machine Translation with Source-Contrastive and Language-Contrastive Decoding

MLCopilot: Unleashing the Power of Large Language Models in Solving Machine Learning Tasks

Moderation in the Wild: Investigating User-Driven Moderation in Online Discussions

Multi-Level Attention Aggregation for Language-Agnostic Speaker Replication

Multilingual Gradient Word-Order Typology from Universal Dependencies

Multimodal Fallacy Classification in Political Debates

MultiMUC: Multilingual Template Filling on MUC-4

Neuralign: A Context-Aware, Cross-Lingual and Fully-Neural Sentence Alignment System for Long Texts

NevIR: Negation in Neural Information Retrieval

NNOSE: Nearest Neighbor Occupational Skill Extraction

No Error Left Behind: Multilingual Grammatical Error Correction with Pre-trained Translation Models

On the Benefits of Fine-Grained Loss Truncation: A Case Study on Factuality in Summarization

OpenPI2.0: An Improved Dataset for Entity Tracking in Texts

Over-Reasoning and Redundant Calculation of Large Language Models

Plan-Grounded Large Language Models for Dual Goal Conversational Settings

Pre-Training Methods for Question Reranking

Predict the Next Word: Humans exhibit uncertainty in this task and language models

Predicting Client Emotions and Therapist Interventions in Psychotherapy Dialogues

Presentations by the Humans and For the Humans: Harnessing LLMs for Generating Persona-Aware Slides from Documents

Putting Context in Context: the Impact of Discussion Structure on Text Classification

Quality Does Matter: A Detailed Look at the Quality and Utility of Web-Mined Parallel Corpora

Quantifying the Hyperparameter Sensitivity of Neural Networks for Character-level Sequence-to-Sequence Tasks

REFINER: Reasoning Feedback on Intermediate Representations

Rethinking Loss Functions for Fact Verification

Robust Neural Machine Translation for Abugidas by Glyph Perturbation

Scaling up Discovery of Latent Concepts in Deep NLP Models

Semantic Sensitivities and Inconsistent Predictions: Measuring the Fragility of NLI Models

Sensitivity, Performance, Robustness: Deconstructing the Effect of Sociodemographic Prompting

Sentence Representations via Gaussian Embedding

SentenceLDA: Discriminative and Robust Document Representation with Sentence Level Topic Model

SIB-200: A Simple, Inclusive, and Big Evaluation Dataset for Topic Classification in 200+ Languages and Dialects

Small Language Models Improve Giants by Rewriting Their Outputs

SOCIALITE-LLAMA: An Instruction-Tuned Model for Social Scientific Tasks

STORiCo: Storytelling TTS for Hindi with Character Voice Modulation

Syntactic Preposing and Discourse Relations

SynthDST: Synthetic Data is All You Need for Few-Shot Dialog State Tracking

TESS: Text-to-Text Self-Conditioned Simplex Diffusion

Text-Guided Image Clustering

Text-to-Code Generation with Modality-relative Pre-training

Text-to-OverpassQL: A Natural Language Interface for Complex Geodata Querying of OpenStreetMap

The Parrot Dilemma: Human-Labeled vs. LLM-augmented Data in Classification Tasks

The Role of Data Curation in Image Captioning

Think Twice: Measuring the Efficiency of Eliminating Prediction Shortcuts of Question Answering Models

ToPro: Token-Level Prompt Decomposition for Cross-Lingual Sequence Labeling Tasks

Towards Hierarchical Spoken Language Disfluency Modeling

Tracing the Roots of Facts in Multilingual Language Models: Independent, Shared, and Transferred Knowledge

Translate to Disambiguate: Zero-shot Multilingual Word Sense Disambiguation with Pretrained Language Models

Translation Errors Significantly Impact Low-Resource Languages in Cross-Lingual Learning

Uncovering Stereotypes in Large Language Models: A Task Complexity-based Approach

Unintended Bias Detection and Mitigation in Misogynous Memes

Unleashing the Power of Discourse-Enhanced Transformers for Propaganda Detection

UNSEE: Unsupervised Non-contrastive Sentence Embeddings

Unsupervised Contrast-Consistent Ranking with Language Models

Unsupervised stance detection for social media discussions: A generic baseline

VEIL: Vetting Extracted Image Labels from In-the-Wild Captions for Weakly-Supervised Object Detection

What Makes Medical Claims (Un)Verifiable? Analyzing Entity and Relation Properties for Fact Verification

WSC+: Enhancing The Winograd Schema Challenge Using Tree-of-Experts

Poster Presentations (+ 12 minute video recording)

A Classification-Guided Approach for Adversarial Attacks against Neural Machine Translation

A Dataset for Metaphor Detection in Early Medieval Hebrew Poetry

A Multimodal Framework to Detect Target Aware Aggression in Memes

A RelEntLess Benchmark for Modelling Graded Relations between Named Entities

“According to . . . ”: Prompting Language Models Improves Quoting from Pre-Training Data

Advancing Precise Outline-Conditioned Text Generation with Task Duality and Explicit Outline Control

Anisotropy Is Inherent to Self-Attention in Transformers

Answering legal questions from laymen in German civil law system

Approximate Attributions for Off-the-Shelf Siamese Transformers

Are Character-level Translations Worth the Wait? Comparing Pretrained Character- and Subword-level Models for Machine Translation

Argument Mining as a Text-to-Text Generation Task

Ask, Assess, and Refine: Rectifying Factual Consistency and Hallucination in LLMs with Metric-Guided Feedback Learning

Backward Compatibility During Data Updates by Weight Interpolation

CCPrefix: Counterfactual Contrastive Prefix-Tuning for Many-Class Classification

CEAN: Contrastive Event Aggregation Network with LLM-based Augmentation for Event Extraction

CharSpan: Utilizing Lexical Similarity to Enable Zero-Shot Machine Translation for Extremely Low-resource Languages

Comparing Knowledge Sources for Open-Domain Scientific Claim Verification

Comparing Template-based and Template-free Language Model Probing

ConstraintChecker: A Plugin for Large Language Models to Reason on Commonsense Knowledge Bases

Corpus-Steered Query Expansion with Large Language Models

Creating Suspenseful Stories: Iterative Planning with Large Language Models

Describing Images $\textit{Fast and Slow}$: Quantifying and Predicting the Variation in Human Signals during Visuo-Linguistic Processes

Desiderata For The Context Use Of Question Answering Systems

Discovering and Articulating Frames of Communication from Social Media Using Chain-of-Thought Reasoning

Do Text Simplification Systems Convey Correct Information? A Human Evaluation via Reading Comprehension

Effective Controllable Bias Mitigation for Classification and Retrieval using Gate Adapters

EnCore: Fine-Grained Entity Typing by Pre-Training Entity Encoders on Coreference Chains

Entity-level Factual Adaptiveness of Fine-tuning based Abstractive Summarization Models

Equipping Language Models with Tool Use Capability for Tabular Data Analysis in Finance

Evaluating the Factuality of Zero-shot Summarizers Across Varied Domains

Evaluating Unsupervised Argument Aligners via Generation of Conclusions of Structured Scientific Abstracts

EXPLORER: Exploration-guided Reasoning for Textual Reinforcement Learning

Exploring the Robustness of Task-oriented Dialogue Systems for Colloquial German Varieties

Few-Shot Dialogue Summarization via Skeleton-Assisted Prompt Transfer in Prompt Tuning

FinBPM: A Framework for Portfolio Management-based Financial Investor Behavior Perception Model

Flow Matching for Conditional Text Generation in a Few Sampling Steps

French GossipPrompts: Dataset For Prevention of Generating French Gossip Stories By LLMs

GAINER: Graph Machine Learning with Node-specific Radius for Classification of Short Texts and Documents

GEAR: Augmenting Language Models with Generalizable and Efficient Tool Resolution

Generating Benchmarks for Factuality Evaluation of Language Models

Generation-driven Contrastive Self-training for Zero-shot Text Classification with Instruction-following LLM

GUMsley: Evaluating Entity Salience in Summarization for 12 English Genres

Human Temporal Inferences Go Beyond Aspectual Class

Importance-Aware Data Augmentation for Document-Level Neural Machine Translation

Interpreting Predictive Probabilities: Model Confidence or Human Label Variation?

Investigating Agency of LLMs in Human-AI Collaboration Tasks

Investigating Content Planning for Navigating Trade-offs in Knowledge-Grounded Dialogue

Investigating the Potential of Task Arithmetic for Cross-Lingual Transfer

Language Model Sentence Completion with a Parser-Driven Rhetorical Control Method

Language Models as Inductive Reasoners

LAraBench: Benchmarking Arabic AI with Large Language Models

Learning to Retrieve In-Context Examples for Large Language Models

Leveraging Multi-lingual Positive Instances in Contrastive Learning to Improve Sentence Embedding

Like a Good Nearest Neighbor: Practical Content Moderation and Text Classification

LLM Comparative Assessment: Zero-shot NLG Evaluation through Pairwise Comparisons using Large Language Models

Localization vs. Semantics: Visual Representations in Unimodal and Multimodal Models

Measuring Uncertainty in Neural Machine Translation with Similarity-Sensitive Entropy

More Discriminative Sentence Embeddings via Semantic Graph Smoothing

$\mu$PLAN: Summarizing using a Content Plan as Cross-Lingual Bridge

Multi-Reference Benchmarks for Russian Grammatical Error Correction

Multi-Relational Hyperbolic Word Embeddings from Natural Language Definitions

Parameter-Efficient Conversational Recommender System as a Language Processing Task

PEARL: Prompting Large Language Models to Plan and Execute Actions Over Long Documents

Polarized Opinion Detection Improves the Detection of Toxic Language

Quantifying Stereotypes in Language

Rainbow - A Benchmark for Systematic Testing of How Sensitive Visio-Linguistic Models are to Color Naming

SCO-VIST: Social Interaction Commonsense Knowledge-based Visual Storytelling

Should I try multiple optimizers when fine-tuning a pre-trained Transformer for NLP tasks? Should I tune their hyperparameters?

Smaller Language Models are Better Zero-shot Machine-Generated Text Detectors

Source Identification in Abstractive Summarization

SPUQ: Perturbation-Based Uncertainty Quantification for Large Language Models

STable: Table Generation Framework for Encoder-Decoder Models

Style-News: Incorporating Stylized News Generation and Adversarial Verification for Neural Fake News Detection

System-Level Natural Language Feedback

Threat Behavior Textual Search by Attention Graph Isomorphism

UP5: Unbiased Foundation Model for Fairness-aware Recommendation

ViLexNorm: A Lexical Normalization Corpus for Vietnamese Social Media Text

VlogQA: Task, Dataset, and Baseline Models for Vietnamese Spoken-Based Machine Reading Comprehension

VOLTAGE: A Versatile Contrastive Learning based OCR Methodology for ultra low-resource scripts through Auto Glyph Feature Extraction

Where Do We Go From Here? Multi-scale Allocentric Relational Inferencefrom Natural Spatial Descriptions

Who Needs Decoders? Efficient Estimation of Sequence-Level Attributes with Proxies

Zero-Shot End-to-End Spoken Language Understanding via Cross-Modal Selective Self-Training

Zero-shot Sentiment Analysis in Low-Resource Languages Using a Multilingual Sentiment Lexicon