Timezone: Conference (Singapore) UTC Browser
Timezone: Conference (Singapore) UTC Browser
Timezone: Conference (Singapore) UTC Browser
Poster_Demo_Industry Hybrid 1
Poster Presentations
Demo (Poster)
Room: East Foyer(Virtual)
Dialogue and Interactive Systems (Poster)
Room: East Foyer(Virtual)
Human-Centered NLP (Poster)
Room: East Foyer(Virtual)
Industry (Poster)
Room: East Foyer(Virtual)
Information Extraction (Poster)
Room: East Foyer(Virtual)
- Unified Low-Resource Sequence Labeling by Sample-Aware Dynamic Sparse Finetuning
- MProto: Multi-Prototype Network with Denoised Optimal Transport for Distantly Supervised Named Entity Recognition
- Document-level Relationship Extraction by Bidirectional Constraints of Beta Rules
- Set Learning for Generative Information Extraction
- Abstractive Open Information Extraction
- HyperRank: Hyperbolic Ranking Model for Unsupervised Keyphrase Extraction
- T2-NER: A Two-Stage Span-based Framework For Unified Named Entity Recognition with Templates
- U-CORE: A Unified Deep Cluster-wise Contrastive Framework for Open Relation Extraction
Language Grounding to Vision, Robotics and Beyond (Poster)
Room: East Foyer(Virtual)
Multilinguality and Linguistic Diversity (Poster)
Room: East Foyer(Virtual)
NLP Applications (Poster)
Room: East Foyer(Virtual)
- AutoTrial: Prompting Language Models for Clinical Trial Design
- The Benefits of Label-Description Training for Zero-Shot Text Classification
- Content- and Topology-Aware Representation Learning for Scientific Multi-Literature
- Detecting Spoilers in Movie Reviews with External Movie Knowledge and User Networks
- GenEx: A Commonsense-aware Unified Generative Framework for Explainable Cyberbullying Detection
- PhenotypeCLIP: Phenotype-based Contrastive Learning for Medical Imaging Report Generation
Poster_Demo_Industry Hybrid 2
Poster Presentations
Demo (Poster)
Room: East Foyer(Virtual)
Industry (Poster)
Room: East Foyer(Virtual)
Information Extraction (Poster)
Room: East Foyer(Virtual)
- Generating Commonsense Counterfactuals for Stable Relation Extraction
- A Comprehensive Evaluation of Biomedical Entity Linking Models
- Addressing NER Annotation Noises with Uncertainty-Guided Tree-Structured CRFs
- TacoPrompt: A Collaborative Multi-Task Prompt Learning Method for Self-Supervised Taxonomy Completion
- Open Information Extraction via Chunks
- ScdNER: Span-Based Consistency-Aware Document-Level Named Entity Recognition
- Mitigating Over-Generation for Unsupervised Keyphrase Extraction with Heterogeneous Centrality Detection
Language Modeling and Analysis of Language Models (Poster)
Room: East Foyer(Virtual)
- Primacy Effect of ChatGPT
- Explanation Selection Using Unlabeled Data for Chain-of-Thought Prompting
- Generating Data for Symbolic Language with Large Language Models
- Privacy Implications of Retrieval-Based Language Models
- Noisy Exemplars Make Large Language Models More Robust: A Domain-Agnostic Behavioral Analysis
- Removing Backdoors in Pre-trained Models by Regularized Continual Pre-training
Machine Translation (Poster)
Room: East Foyer(Virtual)
Natural Language Generation (Poster)
Room: East Foyer(Virtual)
- Air-Decoding: Attribute Distribution Reconstruction for Decoding-Time Controllable Text Generation
- E-CORE: Emotion Correlation Enhanced Empathetic Dialogue Generation
- G-Eval: NLG Evaluation using Gpt-4 with Better Human Alignment
- $k$NN-LM Does Not Improve Open-ended Text Generation
- A Self-training Framework for Automated Medical Report Generation
NLP Applications (Poster)
Room: East Foyer(Virtual)
Sentiment Analysis, Stylistic Analysis, and Argument Mining (Poster)
Room: East Foyer(Virtual)
Poster_Demo_Industry Hybrid 3
Poster Presentations
Dialogue and Interactive Systems (Poster)
Room: East Foyer(Virtual)
- Building Persona Consistent Dialogue Agents with Offline Reinforcement Learning
- Retrieval-Generation Alignment for End-to-End Task-Oriented Dialogue System
- CoF-CoT: Enhancing Large Language Models with Coarse-to-Fine Chain-of-Thought Prompting for Multi-domain NLU Tasks
- AnyTOD: A Programmable Task-Oriented Dialog System
Industry (Poster)
Room: East Foyer(Virtual)
Phonology, Morphology, and Word Segmentation (Poster)
Room: East Foyer(Virtual)
Question Answering (Poster)
Room: East Foyer(Virtual)
Resources and Evaluation (Poster)
Room: East Foyer(Virtual)
Semantics: Lexical, Sentence level, Document Level, Textual Inference, etc. (Poster)
Room: East Foyer(Virtual)
- Towards Unsupervised Recognition of Token-level Semantic Differences in Related Documents
- Are Embedded Potatoes Still Vegetables? On the Limitations of WordNet Embeddings for Lexical Semantics
- ZEROTOP: Zero-Shot Task-Oriented Semantic Parsing using Large Language Models
- IEKG: A Commonsense Knowledge Graph for Idiomatic Expressions
- OssCSE: Overcoming Surface Structure Bias in Contrastive Learning for Unsupervised Sentence Embedding
- Finding Authentic Counterhate Arguments: A Case Study with Public Figures
- Benchmarking and Improving Text-to-SQL Generation under Ambiguity
- Zero-Shot Multi-Label Topic Inference with Sentence Encoders and LLMs
Sentiment Analysis, Stylistic Analysis, and Argument Mining (Poster)
Room: East Foyer(Virtual)
Speech and Multimodality (Poster)
Room: East Foyer(Virtual)
- Towards Noise-Tolerant Speech-Referring Video Object Segmentation: Bridging Speech and Text
- Natural Disaster Tweets Classification Using Multimodal Data
- ART: rule bAsed futuRe-inference deducTion
- Support or Refute: Analyzing the Stance of Evidence to Detect Out-of-Context Mis- and Disinformation
- End-to-End Single-Channel Speaker-Turn Aware Conversational Speech Translation
Poster_Demo_Industry_Findings In-person 1
Poster Presentations
Demo (Poster)
Room: East Foyer
- CHAMP: Efficient Annotation and Consolidation of Cluster Hierarchies
- Humanoid Agents: Platform for Simulating Human-like Generative Agents
- CHATREPORT: Democratizing Sustainability Disclosure Analysis through LLM-based Tools
- RaLLe: A Framework for Developing and Evaluating Retrieval-Augmented Large Language Models
- H2O Open Ecosystem for State-of-the-art Large Language Models
- Koala: An Index for Quantifying Overlaps with Pre-training Corpora
- Sudowoodo: a Chinese Lyric Imitation System with Source Lyrics
- ConvLab-3: A Flexible Dialogue System Toolkit Based on a Unified Data Format
- FLEEK: Factual Error Detection and Correction with Evidence Retrieved from External Knowledge
- INTELMO: Enhancing Models' Adoption of Interactive Interfaces
Dialogue and Interactive Systems (Poster)
Room: East Foyer
- Baize: An Open-Source Chat Model with Parameter-Efficient Tuning on Self-Chat Data
- DiSTRICT: Dialogue State Tracking with Retriever Driven In-Context Tuning
- Contrastive Learning for Inference in Dialogue
- Post-hoc Utterance Refining Method by Entity Mining for Faithful Knowledge Grounded Conversations
- ChatEdit: Towards Multi-turn Interactive Facial Image Editing via Dialogue
- Towards LLM-driven Dialogue State Tracking
- Turn-Level Active Learning for Dialogue State Tracking
- Multi-level Adaptive Contrastive Learning for Knowledge Internalization in Dialogue Generation
- End-to-end Task-oriented Dialogue: A Survey of Tasks, Methods, and Future Directions
- KRLS: Improving End-to-End Response Generation in Task Oriented Dialog with Reinforced Keywords Learning
- Dual-Feedback Knowledge Retrieval for Task-Oriented Dialogue Systems
- Target-oriented Proactive Dialogue Systems with Personalization: Problem Formulation and Dataset Curation
- Multi-Source Multi-Type Knowledge Exploration and Exploitation for Dialogue Generation
- Fine-grained Conversational Decoding via Isotropic and Proximal Search
- Reinforced Target-driven Conversational Promotion
- Transfer-Free Data-Efficient Multilingual Slot Labeling
- A Systematic Study of Performance Disparities in Multilingual Task-Oriented Dialogue Systems
Human-Centered NLP (Poster)
Room: East Foyer
- Towards Conceptualization of ``Fair Explanation'': Disparate Impacts of anti-Asian Hate Speech Explanations on Content Moderators
- Did You Mean...? Confidence-based Trade-offs in Semantic Parsing
- BiasX: “Thinking Slow” in Toxic Content Moderation with Explanations of Implied Social Biases
- ACTOR: Active Learning with Annotator-specific Classification Heads to Embrace Human Label Variation
- Appraising the Potential Uses and Harms of LLMs for Medical Systematic Reviews
- Confidence-based Ensembling of Perspective-aware Models
- Physician Detection of Clinical Harm in Machine Translation: Quality Estimation Aids in Reliance and Backtranslation Identifies Critical Errors
- Explaining with Contrastive Phrasal Highlighting: A Case Study in Assisting Humans to Detect Translation Differences
- What Else Do I Need to Know? The Effect of Background Information on Users’ Reliance on QA Systems
- When the Majority is Wrong: Modeling Annotator Disagreement for Subjective Tasks
- From Dissonance to Insights: Dissecting Disagreements in Rationale Construction for Case Outcome Classification
- Bridging the Gap: A Survey on Integrating (Human) Feedback for Natural Language Generation
Information Extraction (Poster)
Room: East Foyer
- Log-FGAER: Logic-Guided Fine-Grained Address Entity Recognition from Multi-Turn Spoken Dialogue
- An Iteratively Parallel Generation Method with the Pre-Filling Strategy for Document-level Event Extraction
- Empirical Study of Zero-Shot NER with ChatGPT
- SAMRank: Unsupervised Keyphrase Extraction using Self-Attention Map in BERT and GPT-2
- Revisiting Sparse Retrieval for Few-shot Entity Linking
- Vision-Enhanced Semantic Entity Recognition in Document Images via Visually-Asymmetric Consistency Learning
- NeuSTIP: A Neuro-Symbolic Model for Link and Time Prediction in Temporal Knowledge Graphs
- Improving Unsupervised Relation Extraction by Augmenting Diverse Sentence Pairs
- S2abEL: A Dataset for Entity Linking from Scientific Tables
- Joint Entity and Relation Extraction with Span Pruning and Hypergraph Neural Networks
- Exploiting Asymmetry for Synthetic Training Data Generation: SynthIE and the Case of Information Extraction
- HiddenTables and PyQTax: A Cooperative Game and Dataset For TableQA to Ensure Scale and Data Privacy Across a Myriad of Taxonomies
- Semi-automatic Data Enhancement for Document-Level Relation Extraction with Distant Supervision from Large Language Models
- Towards Building More Robust NER datasets: An Empirical Study on NER Dataset Bias from a Dataset Difficulty View
- Learning to Rank Context for Named Entity Recognition Using a Synthetic Dataset
- HyperNetwork-based Decoupling to Improve Model Generalization for Few-Shot Relation Extraction
- When Reviewers Lock Horns: Finding Disagreements in Scientific Peer Reviews
- Adaptive End-to-End Metric Learning for Zero-Shot Cross-Domain Slot Filling
Language Grounding to Vision, Robotics and Beyond (Poster)
Room: East Foyer
- MemeCap: A Dataset for Captioning and Interpreting Memes
- Incorporating Structured Representations into Pretrained Vision \& Language Models Using Scene Graphs
- From Wrong To Right: A Recursive Approach Towards Vision-Language Explanation
- Coarse-to-Fine Contrastive Learning in Image-Text-Graph Space for Improved Vision-Language Compositionality
- Referring Image Segmentation via Joint Mask Contextual Embedding Learning and Progressive Alignment Network
- Prompting Scientific Names for Zero-Shot Species Recognition
- IC3: Image Captioning by Committee Consensus
- Can Language Models Understand Physical Concepts?
- GazeVQA: A Video Question Answering Dataset for Multiview Eye-Gaze Task-Oriented Collaborations
- Semi-supervised multimodal coreference resolution in image narrations
- VLIS: Unimodal Language Models Guide Multimodal Language Generation
- Evaluating Object Hallucination in Large Vision-Language Models
- DueT: Image-Text Contrastive Transfer Learning with Dual-adapter Tuning
- EDIS: Entity-Driven Image Search over Multimodal Web Content
- APoLLo : Unified Adapter and Prompt Learning for Vision Language Models
- Symbolic Planning and Code Generation for Grounded Dialogue
- Can Language Models Laugh at YouTube Short-form Videos?
- Can Pre-trained Vision and Language Models Answer Visual Information-Seeking Questions?
- Causal Reasoning through Two Cognition Layers for Improving Generalization in Visual Question Answering
- LACMA: Language-Aligning Contrastive Learning with Meta-Actions for Embodied Instruction Following
- Weakly-Supervised Learning of Visual Relations in Multimodal Pretraining
- Evaluating Bias and Fairness in Gender-Neutral Pretrained Vision-and-Language Models
- Impressions: Visual Semiotics and Aesthetic Impact Understanding
- CLEVR-Implicit: A Diagnostic Dataset for Implicit Reasoning in Referring Expression Comprehension
- ViPE: Visualise Pretty-much Everything
- Visually-Situated Natural Language Understanding with Contrastive Reading Model and Frozen Large Language Models
- Multitask Multimodal Prompted Training for Interactive Embodied Task Completion
- GROOViST: A Metric for Grounding Objects in Visual Storytelling
- Multimodal Embodied Plan Prediction Augmented with Synthetic Embodied Dialogue
- The BLA Benchmark: Investigating Basic Language Abilities of Pre-Trained Multimodal Models
- Analyzing Modular Approaches for Visual Question Decomposition
- Emergence of Abstract State Representations in Embodied Sequence Modeling
- ACQUIRED: A Dataset for Answering Counterfactual Questions In Real-Life Videos
- Let's Think Frame by Frame with VIP: A Video Infilling and Prediction Dataset for Evaluating Video Chain-of-Thought
- A Picture is Worth a Thousand Words: Language Models Plan from Pixels
- Reader: Model-based language-instructed reinforcement learning
- UniChart: A Universal Vision-language Pretrained Model for Chart Comprehension and Reasoning
- NL2TL: Transforming Natural Languages to Temporal Logics using Large Language Models
Linguistic Theories, Cognitive Modeling, and Psycholinguistics (Poster)
Room: East Foyer
- Variance Matters: Detecting Semantic Differences without Corpus/Word Alignment
- ScanDL: A Diffusion Model for Generating Synthetic Scanpaths on Texts
- Human Learning by Model Feedback: The Dynamics of Iterative Prompting with Midjourney
- Analyzing Cognitive Plausibility of Subword Tokenization
- Revisiting the Optimality of Word Lengths
- Understanding the Role of Input Token Characters in Language Models: How Does Information Loss Affect Performance?
- Language and Mental Health: Measures of Emotion Dynamics from Text as Linguistic Biosocial Markers
- Language Model Quality Correlates with Psychometric Predictive Power in Multiple Languages
- Testing the Predictions of Surprisal Theory in 11 Languages
Multilinguality and Linguistic Diversity (Poster)
Room: East Foyer
- CompoundPiece: Evaluating and Improving Decompounding Performance of Language Models
- LLM-powered Data Augmentation for Enhanced Cross-lingual Performance
- How do languages influence each other? Studying cross-lingual data sharing during LM fine-tuning
- Global Voices, Local Biases: Socio-Cultural Prejudices across Languages
- XLM-V: Overcoming the Vocabulary Bottleneck in Multilingual Masked Language Models
- DADA: Dialect Adaptation via Dynamic Aggregation of Linguistic Rules
- LIMIT: Language Identification, Misidentification, and Translation using Hierarchical Models in 350+ Languages
- Unifying Cross-Lingual Transfer across Scenarios of Resource Scarcity
- Cross-Lingual Consistency of Factual Knowledge in Multilingual Language Models
- ZGUL: Zero-shot Generalization to Unseen Languages using Multi-source Ensembling of Language Adapters
- Evaluating and Modeling Attribution for Cross-Lingual Question Answering
- Task-Agnostic Low-Rank Adapters for Unseen English Dialects
- Better Quality Pre-training Data and T5 Models for African Languages
- Struct-XLM: A Structure Discovery Multilingual Language Model for Enhancing Cross-lingual Transfer through Reinforcement Learning
- GlobalBench: A Benchmark for Global Progress in Natural Language Processing
- ALDi: Quantifying the Arabic Level of Dialectness of Text
- Clustering Pseudo Language Family in Multilingual Translation Models with Fisher Information Matrix
- Language Varieties of Italy: Technology Challenges and Opportunities
- mGPT: Few-Shot Learners Go Multilingual
Natural Language Generation (Poster)
Room: East Foyer
NLP Applications (Poster)
Room: East Foyer
- Improving Transformer-based Program Repair Model through False Behavior Diagnosis
- POE: Process of Elimination for Multiple Choice Reasoning
- Generative Table Pre-training Empowers Models for Tabular Prediction
- Spoiler Detection as Semantic Text Matching
- GEM: Gestalt Enhanced Markup Language Model for Web Understanding via Render Tree
- Continual Named Entity Recognition without Catastrophic Forgetting
- A Generation-based Deductive Method for Math Word Problems
- GNAT: A General Narrative Alignment Tool
- Analyzing Film Adaptation through Narrative Alignment
- DALE: Generative Data Augmentation for Low-Resource Legal NLP
- StructGPT: A General Framework for Large Language Model to Reason over Structured Data
- Towards Low-Resource Automatic Program Repair with Meta-Learning and Pretrained Language Models
- Beyond Detection: A Defend-and-Summarize Strategy for Robust and Interpretable Rumor Analysis on Social Media
- Event Ontology Completion with Hierarchical Structure Evolution Networks
- COFFEE: Counterfactual Fairness for Personalized Text Generation in Explainable Recommendation
- Enhancing Textbooks with Visuals from the Web for Improved Learning
- BioT5: Enriching Cross-modal Integration in Biology with Chemical Knowledge and Natural Language Associations
- RobustGEC: Robust Grammatical Error Correction Against Subtle Context Perturbation
- Polar Ducks and Where to Find Them: Enhancing Entity Linking with Duck Typing and Polar Box Embeddings
- Text Embeddings Reveal (Almost) As Much As Text
- Modeling Legal Reasoning: LM Annotation at the Edge of Human Agreement
- An Expression Tree Decoding Strategy for Mathematical Equation Generation
- QA-NatVer: Question Answering for Natural Logic-based Fact Verification
- SCITAB: A Challenging Benchmark for Compositional Reasoning and Claim Verification on Scientific Tables
Poster_Demo_Industry_Findings In-person 2
Poster Presentations
Ethics in NLP (Poster)
Room: East Foyer
Information Extraction (Poster)
Room: East Foyer
- CQE: A Comprehensive Quantity Extractor
- Rationale-Enhanced Language Models are Better Continual Relation Learners
- Event Causality Extraction via Implicit Cause-Effect Interactions
- Biomedical Named Entity Recognition via Dictionary-based Synonym Generalization
- Open-world Semi-supervised Generalized Relation Discovery Aligned in a Real-world Setting
- A Unified View of Evaluation Metrics for Structured Prediction
- GreedyCAS: Unsupervised Scientific Abstract Segmentation with Normalized Mutual Information
- Enhancing Low-resource Fine-grained Named Entity Recognition by Leveraging Coarse-grained Datasets
- Linking Surface Facts to Large-Scale Knowledge Graphs
- Mirror: A Universal Framework for Various Information Extraction Tasks
- Lazy-k Decoding: Constrained Decoding for Information Extraction
- GLEN: General-Purpose Event Detection for Thousands of Types
- Multi-level Contrastive Learning for Script-based Character Understanding
- MailEx: Email Event and Argument Extraction
- SKD-NER: Continual Named Entity Recognition via Span-based Knowledge Distillation with Reinforcement Learning
- KEPL: Knowledge Enhanced Prompt Learning for Chinese Hypernym-Hyponym Extraction
- Taxonomy Expansion for Named Entity Recognition
Interpretability, Interactivity, and Analysis of Models for NLP (Poster)
Room: East Foyer
Language Grounding to Vision, Robotics and Beyond (Poster)
Room: East Foyer
Language Modeling and Analysis of Language Models (Poster)
Room: East Foyer
- Symbol tuning improves in-context learning in language models
- Collaborative Generative AI: Integrating GPT-k for Efficient Editing in Text-to-Image Generation
- Automatic Prompt Optimization with "Gradient Descent" and Beam Search
- CoMPosT: Characterizing and Evaluating Caricature in LLM Simulations
- Towards A Unified View of Sparse Feed-Forward Network in Pretraining Large Language Model
- An Investigation of LLMs’ Inefficacy in Understanding Converse Relations
- DEPN: Detecting and Editing Privacy Neurons in Pretrained Language Models
- Effects of sub-word segmentation on performance of transformer language models
- Adapting Language Models to Compress Contexts
- Personalized Distillation: Empowering Open-Sourced LLMs with Adaptive Learning for Code Generation
- trlX: A Framework for Large Scale Reinforcement Learning from Human Feedback
- MQuAKE: Assessing Knowledge Editing in Language Models via Multi-Hop Questions
- Aligning Large Language Models through Synthetic Feedback
- MAF: Multi-Aspect Feedback for Improving Reasoning in Large Language Models
- Does the Correctness of Factual Knowledge Matter for Factual Knowledge-Enhanced Pre-trained Language Models?
- Learning from Mistakes via Cooperative Study Assistant for Large Language Models
- Axiomatic Preference Modeling for Longform Question Answering
- Representative Demonstration Selection for In-Context Learning with Two-Stage Determinantal Point Process
- Self-Influence Guided Data Reweighting for Language Model Pre-training
- Text Rendering Strategies for Pixel Language Models
- The Effect of Scaling, Retrieval Augmentation and Form on the Factual Consistency of Language Models
- Failures Pave the Way: Enhancing Large Language Models through Tuning-free Rule Accumulation
- Structural Priming Demonstrates Abstract Grammatical Representations in Multilingual Language Models
- Data Similarity is Not Enough to Explain Language Model Performance
- Transcending Scaling Laws with 0.1% Extra Compute
- Recurrent Neural Language Models as Probabilistic Finite-state Automata
- Dynosaur: A Dynamic Growth Paradigm for Instruction-Tuning Data Curation
- Oolong: Investigating What Makes Transfer Learning Hard with Controlled Studies
- Just Ask for Calibration: Strategies for Eliciting Calibrated Confidence Scores from Language Models Fine-Tuned with Human Feedback
- Preserving Privacy Through Dememorization: An Unlearning Technique For Mitigating Memorization Risks In Language Models
- Unveiling the Implicit Toxicity in Large Language Models
- Do Language Models Have a Common Sense regarding Time? Revisiting Temporal Commonsense Reasoning in the Era of Large Language Models
- Evaluating Large Language Models on Controlled Generation Tasks
- Learning Preference Model for LLMs via Automatic Preference Data Generation
- CRaSh: Clustering, Removing, and Sharing Enhance Fine-tuning without Full Large Language Model
- The Curious Case of Hallucinatory (Un)answerability: Finding Truths in the Hidden States of Over-Confident Large Language Models
- On the Representational Capacity of Recurrent Neural Language Models
- Unnatural Error Correlation: GPT-4 Can Almost Perfectly Handle Unnatural Scrambled Text
- Adapting to the Long Tail: A Meta-Analysis of Transfer Learning Research for Language Understanding Tasks
- Compositional Zero-Shot Domain Transfer with Text-to-Text Models
- Communication Drives the Emergence of Language Universals in Neural Agents: Evidence from the Word-order/Case-marking Trade-off"
Machine Learning for NLP (Poster)
Room: East Foyer
Machine Translation (Poster)
Room: East Foyer
- Video-Helpful Multimodal Machine Translation
- GATITOS: Using a New Multilingual Lexicon for Low-resource Machine Translation
- DecoMT: Decomposed Prompting for Machine Translation Between Related Languages using Large Language Models
- Target-Agnostic Gender-Aware Contrastive Learning for Mitigating Bias in Multilingual Machine Translation
- Exploring All-In-One Knowledge Distillation Framework for Neural Machine Translation
- Nearest Neighbor Machine Translation is Meta-Optimizer on Output Projection Layer
- Multilingual \textit{k}-Nearest-Neighbor Machine Translation
- Improved Pseudo Data for Machine Translation Quality Estimation with Constrained Beam Search
- Towards a Better Understanding of Variations in Zero-Shot Neural Machine Translation Performance
- Beyond Shared Vocabulary: Increasing Representational Word Similarities across Languages for Multilingual Machine Translation
- RainProof: An Umbrella to Shield Text Generator from Out-Of-Distribution Data
- MMNMT: Modularizing Multilingual Neural Machine Translation with Flexibly Assembled MoE and Dense Blocks
- Rethinking Word-Level Auto-Completion in Computer-Aided Translation
- PROSE: A Pronoun Omission Solution for Chinese-English Spoken Language Translation
- Condensing Multilingual Knowledge with Lightweight Language-Specific Modules
- Challenges in Context-Aware Neural Machine Translation
- CLAD-ST: Contrastive Learning with Adversarial Data for Robust Speech Translation
- Towards Example-Based NMT with Multi-Levenshtein Transformers
- Exploring Discourse Structure in Document-level Machine Translation
- Revisiting Source Context in Nearest Neighbor Machine Translation
- An Empirical Study of Translation Hypothesis Ensembling with Large Language Models
- Adaptive Policy with Wait-k Model for Simultaneous Translation
- MT2: Towards a Multi-Task Machine Translation Model with Translation-Specific In-Context Learning
- Bridging Background Knowledge Gaps in Translation with Automatic Explicitation
- Ties Matter: Meta-Evaluating Modern Metrics with Pairwise Accuracy and Tie Calibration
- Learn and Consolidate: Continual Adaptation for Zero-Shot and Multilingual Neural Machine Translation
- Adapting Offline Speech Translation Models for Streaming with Future-Aware Distillation and Inference
- Hallucinations in Large Multilingual Translation Models
- Rethinking the Exploitation of Monolingual Data for Low-Resource Neural Machine Translation
Natural Language Generation (Poster)
Room: East Foyer
- Self-Ensemble of $N$-best Generation Hypotheses by Lexically Constrained Decoding
- Self-Detoxifying Language Models via Toxification Reversal
- SummEdits: Measuring LLM Ability at Factual Reasoning Through The Lens of Summarization
- Beat LLMs at Their Own Game: Zero-Shot LLM-Generated Text Detection via Querying ChatGPT
- Prompting Large Language Models with Chain-of-Thought for Few-Shot Knowledge Base Question Generation
- Improving Biomedical Abstractive Summarisation with Knowledge Aggregation from Citation Papers
- Knowledge Graph Compression Enhances Diverse Commonsense Generation
- ReTAG: Reasoning Aware Table to Analytic Text Generation
- ChatGPT to Replace Crowdsourcing of Paraphrases for Intent Classification: Higher Diversity and Comparable Model Robustness
- Controlling Pre-trained Language Models for Grade-Specific Text Simplification
- Enhancing Long-form Text Generation Efficacy with Task-adaptive Tokenization
- Elaborative Simplification as Implicit Questions Under Discussion
- We Are What We Repeatedly Do: Inducing and Deploying Habitual Schemas in Persona-Based Responses
- Multilingual Simplification of Medical Texts
- Text Fact Transfer
- Fast and Accurate Factual Inconsistency Detection Over Long Documents
- Penalty Decoding: Well Suppress the Self-Reinforcement Effect in Open-Ended Text Generation
- Fidelity-Enriched Contrastive Search: Reconciling the Faithfulness-Diversity Trade-Off in Text Generation
- Lifelong Sequence Generation with Dynamic Module Expansion and Adaptation
- KCTS: Knowledge-Constrained Tree Search Decoding with Token-Level Hallucination Detection
NLP Applications (Poster)
Room: East Foyer
- Be Selfish, But Wisely: Investigating the Impact of Agent Personality in Mixed-Motive Human-Agent Interactions
- Specialist or Generalist? Instruction Tuning for Specific NLP Tasks
- FedTherapist: Mental Health Monitoring with User-Generated Linguistic Expressions on Smartphones via Federated Learning
- Interventional Rationalization
- Location-Aware Visual Question Generation with Lightweight Models
- Controllable Contrastive Generation for Multilingual Biomedical Entity Linking
- COVID-19 Vaccine Misinformation in Middle Income Countries
- ByteSized32: A Corpus and Challenge Task for Generating Task-Specific World Models Expressed as Text Games
Sentiment Analysis, Stylistic Analysis, and Argument Mining (Poster)
Room: East Foyer
- VivesDebate-Speech: A Corpus of Spoken Argumentation to Leverage Audio Features for Argument Mining
- Automatic Debate Evaluation with Argumentation Semantics and Natural Language Argument Graph Networks
- Learning Language-guided Adaptive Hyper-modality Representation for Multimodal Sentiment Analysis
- How to Enhance Causal Discrimination of Utterances: A Case on Affective Reasoning
- The Sentiment Problem: A Critical Survey towards Deconstructing Sentiment Analysis
- Characterizing and Verifying Scientific Claims: Qualitative Causal Structure is All You Need
- Target-to-Source Augmentation for Aspect Sentiment Triplet Extraction
- Exploring Jiu-Jitsu Argumentation for Writing Peer Review Rebuttals
- Comparing Styles across Languages
- SentiStream: A Co-Training Framework for Adaptive Online Sentiment Analysis in Evolving Data Streams
- Detecting Propaganda Techniques in Code-Switched Social Media Text
- $\textit{``Don't Take This Out of Context!''}$ On the Need for Contextual Models and Evaluations for Stylistic Rewriting
- Empathy Intent Drives Empathy Detection
- Hi-ArG: Exploring the Integration of Hierarchical Argumentation Graphs in Language Pretraining
- Dual-Channel Span for Aspect Sentiment Triplet Extraction
- A Training-Free Debiasing Framework with Counterfactual Reasoning for Conversational Emotion Detection
- $\textit{Lost in Translation, Found in Spans}$: Identifying Claims in Multilingual Social Media
- Stance Detection on Social Media with Background Knowledge
- Contextual Interaction for Argument Post Quality Assessment
- A Deeper (Autoregressive) Approach to Non-Convergent Discourse Parsing
- Predict the Future from the Past? On the Temporal Data Distribution Shift in Financial Sentiment Classifications
- Architectural Sweet Spots for Modeling Human Label Variation by the Example of Argument Quality: It’s Best to Relate Perspectives!
Poster_Demo_Industry_Findings In-person 3
Poster Presentations
Demo (Poster)
Room: East Foyer
Dialogue and Interactive Systems (Poster)
Room: East Foyer
- TOD-Flow: Modeling the Structure of Task-Oriented Dialogues
- Semantic Space Grounded Weighted Decoding for Multi-Attribute Controllable Dialogue Generation
- CESAR: Automatic Induction of Compositional Instructions for Multi-turn Dialogs
- Rethinking the Evaluation for Conversational Recommendation in the Era of Large Language Models
- A Framework for Vision-Language Warm-up Tasks in Multimodal Dialogue Models
- Causal Document-Grounded Dialogue Pre-training
- HutCRS: Hierarchical User-Interest Tracking for Conversational Recommender System
- Back Transcription as a Method for Evaluating Robustness of Natural Language Understanding Models to Speech Recognition Errors
- Dialogizer: Context-aware Conversational-QA Dataset Generation from Textual Sources
- Graph vs. Sequence: An Empirical Study on Knowledge Forms for Knowledge-Grounded Dialogue
- Re$^3$Dial: Retrieve, Reorganize and Rescale Conversations for Long-Turn Open-Domain Dialogue Pre-training
- TRAVEL: Tag-Aware Conversational FAQ Retrieval via Reinforcement Learning
- A Diffusion Weighted Graph Framework for New Intent Discovery
- PRESTO: A Multilingual Dataset for Parsing Realistic Task-Oriented Dialogs
- Continual Dialogue State Tracking via Example-Guided Question Answering
- Towards a Unified Conversational Recommendation System: Multi-task Learning via Contextualized Knowledge Distillation
- Well Begun is Half Done: Generator-agnostic Knowledge Pre-Selection for Knowledge-Grounded Dialogue
- Scalable-DSC: A Structural Template Prompt Approach to Scalable Dialogue State Correction
- SuperDialseg: A Large-scale Dataset for Supervised Dialogue Segmentation
- Dialogue Chain-of-Thought Distillation for Commonsense-aware Conversational Agents
- Pointwise Mutual Information Based Metric and Decoding Strategy for Faithful Generation in Document Grounded Dialogs
- P5: Plug-and-Play Persona Prompting for Personalized Response Selection
- Learning From Free-Text Human Feedback -- Collect New Datasets Or Extend Existing Ones?
- Interactive Text-to-SQL Generation via Editable Step-by-Step Explanations
- TaskDiff: A Similarity Metric for Task-Oriented Conversations
Industry (Poster)
Room: East Foyer
Information Extraction (Poster)
Room: East Foyer
Information Retrieval and Text Mining (Poster)
Room: East Foyer
Phonology, Morphology, and Word Segmentation (Poster)
Room: East Foyer
Question Answering (Poster)
Room: East Foyer
- Continually Improving Extractive QA via Human Feedback
- IfQA: A Dataset for Open-domain Question Answering under Counterfactual Presuppositions
- ReasoningLM: Enabling Structural Subgraph Reasoning in Pre-trained Language Models for Question Answering over Knowledge Graph
- Knowledge-Augmented Language Model Verification
- Answering Questions by Meta-Reasoning over Multiple Chains of Thought
- Question Answering as Programming for Solving Time-Sensitive Questions
- Interview Evaluation: A Novel Approach for Automatic Evaluation of Conversational Question Answering Models
- When Do Decompositions Help for Machine Reading?
- Let GPT be a Math Tutor: Teaching Math Word Problem Solvers with Customized Exercise Generation
- LM vs LM: Detecting Factual Errors via Cross Examination
- Query Rewriting in Retrieval-Augmented Large Language Models
- API-Assisted Code Generation for Question Answering on Varied Table Structures
- Conversational Semantic Parsing using Dynamic Context Graphs
- Large Language Models are Complex Table Parsers
- Tree of Clarifications: Answering Ambiguous Questions with Retrieval-Augmented Large Language Models
- MarkQA: A large scale KBQA dataset with numerical reasoning
- Best of Both Worlds: Towards Improving Temporal Knowledge Base Question Answering via Targeted Fact Extraction
- From Parse-Execute to Parse-Execute-Refine: Improving Semantic Parser for Complex Question Answering over Knowledge Base
- Diversify Question Generation with Retrieval-Augmented Style Transfer
- PRCA: Fitting Black-Box Large Language Models for Retrieval Question Answering via Pluggable Reward-Driven Contextual Adapter
- SEER : A Knapsack approach to Exemplar Selection for In-Context HybridQA
- Non-Autoregressive Math Word Problem Solver with Unified Tree Structure
- A Simple Baseline for Knowledge-Based Visual Question Answering
- Mitigating Temporal Misalignment by Discarding Outdated Facts
- Have LLMs Advanced Enough? A Challenging Problem Solving Benchmark For Large Language Models
- Selectively Answering Ambiguous Questions
- Active Retrieval Augmented Generation
- Fine-tuned LLMs Know More, Hallucinate Less with Few-Shot Sequence-to-Sequence Semantic Parsing over Wikidata
- DeSIQ: Towards an Unbiased, Challenging Benchmark for Social Intelligence Understanding
- Chain-of-Questions Training with Latent Answers for Robust Multistep Question Answering
- Hop, Union, Generate: Explainable Multi-hop Reasoning without Rationale Supervision
- ToolWriter: Question Specific Tool Synthesis for Tabular Data
- Exploring Contrast Consistency of Open-Domain Question Answering Systems on Minimally Edited Questions
- Intent-calibrated Self-training for Answer Selection in Open-domain Dialogues
- QAmeleon: Multilingual QA with Only 5 Examples
- Analyzing Semantic Faithfulness of Language Models via Input Intervention on Question Answering
Resources and Evaluation (Poster)
Room: East Foyer
- AfriSenti: A Twitter Sentiment Analysis Benchmark for African Languages
- HalOmi: A Manually Annotated Benchmark for Multilingual Hallucination and Omission Detection in Machine Translation
- A Suite of Generative Tasks for Multi-Level Multimodal Webpage Understanding
- Evaluating the Rationale Understanding of Critical Reasoning in Logical Reading Comprehension
- Multilingual Holistic Bias: Extending Descriptors and Patterns to Unveil Demographic Biases in Languages at Scale
- CHEF in the Language Kitchen: A Generative Data Augmentation Leveraging Korean Morpheme Ingredients
- AD-NLP: A Benchmark for Anomaly Detection in Natural Language Processing
- ORCHID: A Chinese Debate Corpus for Target-Independent Stance Detection and Argumentative Dialogue Summarization
- SLOG: A Structural Generalization Benchmark for Semantic Parsing
- Faithful Model Evaluation for Model-Based Metrics
- Enhancing Chat Language Models by Scaling High-quality Instructional Conversations
- CiteBench: A Benchmark for Scientific Citation Text Generation
- Superlim: A Swedish Language Understanding Evaluation Benchmark
- Can We Edit Multimodal Large Language Models?
- FinEntity: Entity-level Sentiment Classification for Financial Texts
- CleanCoNLL: A Nearly Noise-Free Named Entity Recognition Dataset
- EDeR: Towards Understanding Dependency Relations Between Events
- Exploring the Boundaries of GPT-4 in Radiology
- DUMB: A Dutch Model Benchmark
- MILDSum: A Novel Benchmark Dataset for Multilingual Summarization of Indian Legal Case Judgments
- Counter Turing Test (CT2): AI-Generated Text Detection is Not as Easy as You May Think - Introducing AI Detectability Index (ADI)
- FACTIFY3M: A benchmark for multimodal fact verification with explainability through 5W Question-Answering
- On the Challenges of Using Black-Box APIs for Toxicity Evaluation in Research
- Ideology Takes Multiple Looks: A High-Quality Dataset for Multifaceted Ideology Detection
- Evaluating Evaluation Metrics: A Framework for Analyzing NLG Evaluation Metrics using Measurement Theory
- CLEME: Debiasing Multi-reference Evaluation for Grammatical Error Correction
- RoBoCoP: A Comprehensive ROmance BOrrowing COgnate Package and Benchmark for Multilingual Cognate Identification
- CS2W: A Chinese Spoken-to-Written Style Conversion Dataset with Multiple Conversion Types
- StoryAnalogy: Deriving Story-level Analogies from Large Language Models to Unlock Analogical Understanding
- The ACL OCL Corpus: Advancing Open Science in Computational Linguistics
- BLESS: Benchmarking Large Language Models on Sentence Simplification
- mRedditSum: A Multimodal Abstractive Summarization Dataset of Reddit Threads with Images
- Beyond Factuality: A Comprehensive Evaluation of Large Language Models as Knowledge Generators
- A Fair and In-Depth Evaluation of Existing End-to-End Entity Linking Systems
Semantics: Lexical, Sentence level, Document Level, Textual Inference, etc. (Poster)
Room: East Foyer
- NLI4CT: Multi-Evidence Natural Language Inference for Clinical Trial Reports
- Not all quantifiers are equal: Probing Transformer-based language models' understanding of generalised quantifiers
- Cabbage Sweeter than Cake? Analysing the Potential of Large Language Models for Learning Conceptual Spaces
- Retrofitting Light-weight Language Models for Emotions using Supervised Contrastive Learning
- StrAE: Autoencoding for Pre-Trained Embeddings using Explicit Structure
- M$^3$Seg: A Maximum-Minimum Mutual Information Paradigm for Unsupervised Topic Segmentation in ASR Transcripts
- Find-2-Find: Multitask Learning for Anaphora Resolution and Object Localization
- Paraphrase Types for Generation and Detection
- Weakly Supervised Semantic Parsing with Execution-based Spurious Program Filtering
- C-STS: Conditional Semantic Textual Similarity
- BERTie Bott's Every Flavor Labels: A Tasty Introduction to Semantic Role Labeling for Galician
- Bridging Continuous and Discrete Spaces: Interpretable Sentence Representation Learning via Compositional Operations
- Exploring Chain of Thought Style Prompting for Text-to-SQL
- We're Afraid Language Models Aren't Modeling Ambiguity
- Length is a Curse and a Blessing for Document-level Semantics
- AMR Parsing with Causal Hierarchical Attention and Pointers
- To Split or Not to Split: Composing Compounds in Contextual Vector Spaces
- On Graph-based Reentrancy-free Semantic Parsing
- Optimal Transport Posterior Alignment for Cross-lingual Semantic Parsing
- ReCOGS: How Incidental Details of a Logical Form Overshadow an Evaluation of Semantic Interpretation
- Calibrated Interpretation: Confidence Estimation in Semantic Parsing
Speech and Multimodality (Poster)
Room: East Foyer
- Generative Spoken Language Model based on continuous word-sized audio tokens
- STAIR: Learning Sparse Text and Image Representation in Grounded Tokens
- DPP-TTS: Diversifying prosodic features of speech via determinantal point processes
- Improving Chinese Pop Song and Hokkien Gezi Opera Singing Voice Synthesis by Enhancing Local Modeling
- DetGPT: Detect What You Need via Reasoning
- KEBAP: Korean Error Explainable Benchmark Dataset for ASR and Post-processing
- Optimized Tokenization for Transcribed Error Correction
- Speech-enriched Memory for Inference-time Adaptation of ASR Models to Word Dictionaries
- Accented Speech Recognition With Accent-specific Codebooks
- A Challenging Multimodal Video Summary: Simultaneously Extracting and Generating Keyframe-Caption Pairs from Video
- Training Simultaneous Speech Translation with Robust and Random Wait-k-Tokens Strategy
- DiffS2UT: A Semantic Preserving Diffusion Model for Textless Direct Speech-to-Speech Translation
- Speech Recognition and Meaning Interpretation: Towards Disambiguation of Structurally Ambiguous Spoken Utterances in Indonesian
- PromptST: Abstract Prompt Learning for End-to-End Speech Translation
- Speak, Read and Prompt: High-Fidelity Text-to-Speech with Minimal Supervision
- MissModal: Increasing Robustness to Missing Modality in Multimodal Sentiment Analysis
Theme Track: Large Language Models and the Future of NLP (Poster)
Room: East Foyer
Poster_Demo_Industry_Findings Virtual 1
Poster Presentations
Commonsense Reasoning (Poster)
Room: Virtual-Gathertown
Computational Social Science and Cultural Analytics (Poster)
Room: Virtual-Gathertown
Dialogue and Interactive Systems (Poster)
Room: Virtual-Gathertown
Efficient Methods for NLP (Poster)
Room: Virtual-Gathertown
Human-Centered NLP (Poster)
Room: Virtual-Gathertown
Information Extraction (Poster)
Room: Virtual-Gathertown
Information Retrieval and Text Mining (Poster)
Room: Virtual-Gathertown
Interpretability, Interactivity, and Analysis of Models for NLP (Poster)
Room: Virtual-Gathertown
Language Grounding to Vision, Robotics and Beyond (Poster)
Room: Virtual-Gathertown
Language Modeling and Analysis of Language Models (Poster)
Room: Virtual-Gathertown
Linguistic Theories, Cognitive Modeling, and Psycholinguistics (Poster)
Room: Virtual-Gathertown
Machine Translation (Poster)
Room: Virtual-Gathertown
Multilinguality and Linguistic Diversity (Poster)
Room: Virtual-Gathertown
Natural Language Generation (Poster)
Room: Virtual-Gathertown
NLP Applications (Poster)
Room: Virtual-Gathertown
- Zero-Shot-BERT-Adapters: a Zero-Shot Pipeline for Unknown Intent Detection
- Distilling ChatGPT for Explainable Automated Student Answer Assessment
- LLM-in-the-loop: Leveraging Large Language Model for Thematic Analysis
- Legally Enforceable Hate Speech Detection for Public Forums
- Uncovering Limitations in Text-to-Image Generation: A Contrastive Approach with Structured Semantic Alignment
Phonology, Morphology, and Word Segmentation (Poster)
Room: Virtual-Gathertown
Question Answering (Poster)
Room: Virtual-Gathertown
Resources and Evaluation (Poster)
Room: Virtual-Gathertown
- LEXTREME: A Multi-Lingual and Multi-Task Benchmark for the Legal Domain
- PR-MCS: Perturbation Robust Metric for MultiLingual Image Captioning
- MultiCMET: A Novel Chinese Benchmark for Understanding Multimodal Metaphor
- LogicAttack: Adversarial Attacks for Evaluating Logical Consistency of Natural Language Inference
Semantics: Lexical, Sentence level, Document Level, Textual Inference, etc. (Poster)
Room: Virtual-Gathertown
Sentiment Analysis, Stylistic Analysis, and Argument Mining (Poster)
Room: Virtual-Gathertown
Speech and Multimodality (Poster)
Room: Virtual-Gathertown
Summarization (Poster)
Room: Virtual-Gathertown
Syntax, Parsing and their Applications (Poster)
Room: Virtual-Gathertown
Theme Track: Large Language Models and the Future of NLP (Poster)
Room: Virtual-Gathertown
- Can ChatGPT Defend its Belief in Truth? Evaluating LLM Reasoning via Debate
- Three Questions Concerning the Use of Large Language Models to Facilitate Mathematics Learning
- Can Large Language Models Fix Data Annotation Errors? An Empirical Study Using Debatepedia for Query-Focused Text Summarization
- SteerLM: Attribute Conditioned SFT as an (User-Steerable) Alternative to RLHF
- Evaluating Verifiability in Generative Search Engines
- Adaptation with Self-Evaluation to Improve Selective Prediction in LLMs
- MAPO: Boosting Large Language Model Performance with Model-Adaptive Prompt Optimization
- Probing the “Creativity” of Large Language Models: Can models produce divergent semantic association?
- Context-faithful Prompting for Large Language Models
Poster_Demo_Industry_Findings Virtual 2
Poster Presentations
Commonsense Reasoning (Poster)
Room: Virtual-Gathertown
Computational Social Science and Cultural Analytics (Poster)
Room: Virtual-Gathertown
- Who Wrote it and Why? Prompting Large-Language Models for Authorship Verification
- Multimodal Automated Fact-Checking: A Survey
- Identifying Conspiracy Theories News based on Event Relation Graph
- Cultural Compass: Predicting Transfer Learning Success in Offensive Language Detection with Cultural Features
Dialogue and Interactive Systems (Poster)
Room: Virtual-Gathertown
- Beyond Candidates : Adaptive Dialogue Agent Utilizing Persona and Knowledge
- Miracle: Towards Personalized Dialogue Generation with Latent-Space Multiple Personal Attribute Control
- WikiChat: Stopping the Hallucination of Large Language Model Chatbots by Few-Shot Grounding on Wikipedia
- Salespeople vs SalesBot: Exploring the Role of Educational Value in Conversational Recommender Systems
Discourse and Pragmatics (Poster)
Room: Virtual-Gathertown
Efficient Methods for NLP (Poster)
Room: Virtual-Gathertown
Human-Centered NLP (Poster)
Room: Virtual-Gathertown
Information Extraction (Poster)
Room: Virtual-Gathertown
Information Retrieval and Text Mining (Poster)
Room: Virtual-Gathertown
Interpretability, Interactivity, and Analysis of Models for NLP (Poster)
Room: Virtual-Gathertown
Language Grounding to Vision, Robotics and Beyond (Poster)
Room: Virtual-Gathertown
Language Modeling and Analysis of Language Models (Poster)
Room: Virtual-Gathertown
Linguistic Theories, Cognitive Modeling, and Psycholinguistics (Poster)
Room: Virtual-Gathertown
Machine Learning for NLP (Poster)
Room: Virtual-Gathertown
- SELFOOD: Self-Supervised Out-Of-Distribution Detection via Learning to Rank
- Fusing Temporal Graphs into Transformers for Time-Sensitive Question Answering
- InfoCL: Alleviating Catastrophic Forgetting in Continual Text Classification from An Information Theoretic Perspective
- Watermarking PLMs on Classification Tasks by Combining Contrastive Learning with Weight Perturbation
Machine Translation (Poster)
Room: Virtual-Gathertown
Multilinguality and Linguistic Diversity (Poster)
Room: Virtual-Gathertown
Natural Language Generation (Poster)
Room: Virtual-Gathertown
NLP Applications (Poster)
Room: Virtual-Gathertown
Question Answering (Poster)
Room: Virtual-Gathertown
- ReFSQL: A Retrieval-Augmentation Framework for Text-to-SQL Generation
- Enhancing Retrieval-Augmented Large Language Models with Iterative Retrieval-Generation Synergy
- 1-PAGER: One Pass Answer Generation and Evidence Retrieval
- GLGR: Question-aware Global-to-Local Graph Reasoning for Multi-party Dialogue Reading Comprehension
- Interpreting Answers to Yes-No Questions in User-Generated Content
Resources and Evaluation (Poster)
Room: Virtual-Gathertown
Semantics: Lexical, Sentence level, Document Level, Textual Inference, etc. (Poster)
Room: Virtual-Gathertown
Speech and Multimodality (Poster)
Room: Virtual-Gathertown
Theme Track: Large Language Models and the Future of NLP (Poster)
Room: Virtual-Gathertown
- Uniform Complexity for Text Generation
- Is ChatGPT a Good Multi-Party Conversation Solver?
- A Comprehensive Evaluation of Tool-Assisted Generation Strategies
- Emptying the Ocean with a Spoon: Should We Edit Models?
- Towards Concept-Aware Large Language Models
- Unraveling Downstream Gender Bias from Large Language Models: A Study on AI Educational Writing Assistance
Poster_Demo_Industry_Findings Virtual 3
Poster Presentations
Commonsense Reasoning (Poster)
Room: Virtual-Gathertown
Computational Social Science and Cultural Analytics (Poster)
Room: Virtual-Gathertown
Dialogue and Interactive Systems (Poster)
Room: Virtual-Gathertown
Discourse and Pragmatics (Poster)
Room: Virtual-Gathertown
Efficient Methods for NLP (Poster)
Room: Virtual-Gathertown
Human-Centered NLP (Poster)
Room: Virtual-Gathertown
Information Extraction (Poster)
Room: Virtual-Gathertown
Information Retrieval and Text Mining (Poster)
Room: Virtual-Gathertown
Interpretability, Interactivity, and Analysis of Models for NLP (Poster)
Room: Virtual-Gathertown
Language Grounding to Vision, Robotics and Beyond (Poster)
Room: Virtual-Gathertown
Language Modeling and Analysis of Language Models (Poster)
Room: Virtual-Gathertown
- On the Risk of Misinformation Pollution with Large Language Models
- Variator: Accelerating Pre-trained Models with Plug-and-Play Compression Modules
- Give Me the Facts! A Survey on Factual Knowledge Probing in Pre-trained Language Models
- RealBehavior: A Framework for Faithfully Characterizing Foundation Models’ Human-like Behavior Mechanisms
- RWKV: Reinventing RNNs for the Transformer Era
Linguistic Theories, Cognitive Modeling, and Psycholinguistics (Poster)
Room: Virtual-Gathertown
Machine Learning for NLP (Poster)
Room: Virtual-Gathertown
- Orthogonal Subspace Learning for Language Model Continual Learning
- Uncovering the Root of Hate Speech: A Dataset for Identifying Hate Instigating Speech
- Towards Informative Few-Shot Prompt with Maximum Information Gain for In-Context Learning
- HiCL: Hierarchical Contrastive Learning of Unsupervised Sentence Embeddings
- Entity Disambiguation on a Tight Labeling Budget
- RoAST: Robustifying Language Models via Adversarial Perturbation with Selective Training
Machine Translation (Poster)
Room: Virtual-Gathertown
NLP Applications (Poster)
Room: Virtual-Gathertown
- Identifying {Early Maladaptive Schemas} from Mental Health Question Texts
- Pre-training Multi-task Contrastive Learning Models for Scientific Literature Understanding
- Medical Text Simplification: Optimizing for Readability with Unlikelihood Training and Reranked Beam Search Decoding
- Explainable Claim Verification via Knowledge-Grounded Reasoning with Large Language Models
Resources and Evaluation (Poster)
Room: Virtual-Gathertown
- Are Personalized Stochastic Parrots More Dangerous? Evaluating Persona Biases in Dialogue Systems
- MultiCoNER v2: a Large Multilingual dataset for Fine-grained and Noisy Named Entity Recognition
- K-HATERS: A Hate Speech Detection Corpus in Korean with Target-Specific Ratings
- Finding Common Ground: Annotating and Predicting Common Ground in Spoken Conversations
Semantics: Lexical, Sentence level, Document Level, Textual Inference, etc. (Poster)
Room: Virtual-Gathertown
Sentiment Analysis, Stylistic Analysis, and Argument Mining (Poster)
Room: Virtual-Gathertown
Speech and Multimodality (Poster)
Room: Virtual-Gathertown
Summarization (Poster)
Room: Virtual-Gathertown
Theme Track: Large Language Models and the Future of NLP (Poster)
Room: Virtual-Gathertown
- ParroT: Translating during Chat using Large Language Models tuned with Human Translation and Feedback
- SAC$^3$: Reliable Hallucination Detection in Black-Box Language Models via Semantic-aware Cross-check Consistency
- How Predictable Are Large Language Model Capabilities? A Case Study on BIG-bench
- Empowering Psychotherapy with Large Language Models: Cognitive Distortion Detection through Diagnosis of Thought Prompting
- Hi-ToM: A Benchmark for Evaluating Higher-Order Theory of Mind Reasoning in Large Language Models
- TELeR: A General Taxonomy of LLM Prompts for Benchmarking Complex Tasks
Session 2
Oral Presentations
Computational Social Science and Cultural Analytics (Oral)
Room: West 2
- Towards Interpretable Mental Health Analysis with Large Language Models
- A Diachronic Analysis of Paradigm Shifts in NLP Research: When, How, and Why?
- The Skipped Beat: A Study of Sociopragmatic Understanding in LLMs for 64 Languages
- Rumor Detection on Social Media with Crowd Intelligence and ChatGPT-Assisted Networks
- Cultural Concept Adaptation on Multimodal Reasoning
- Reformulating NLP tasks to Capture Longitudinal Manifestation of Language Disorders in People with Dementia.
Dialogue and Interactive Systems 1 (Oral)
Room: West 3
- Learning Retrieval Augmentation for Personalized Dialogue Generation
- Prompt-Based Monte-Carlo Tree Search for Goal-oriented Dialogue Policy Planning
- MADNet: Maximizing Addressee Deduction Expectation for Multi-Party Conversation Generation
- SODA: Million-scale Dialogue Distillation with Social Commonsense Contextualization
- Conversation Chronicles: Towards Diverse Temporal and Relational Dynamics in Multi-Session Conversations
- Enhancing Code-Switching for Cross-lingual SLU: A Unified View of Semantic and Grammatical Coherence
Information Extraction 1 (Oral)
Room: East
- Cross-Document Event Coreference Resolution on Discourse Structure
- Anaphor Assisted Document-Level Relation Extraction
- RAPL: A Relation-Aware Prototype Learning Approach for Few-Shot Document-Level Relation Extraction
- Guideline Learning for In-Context Information Extraction
- Preserving Knowledge Invariance: Rethinking Robustness Evaluation of Open Information Extraction
- Instruct and Extract: Instruction Tuning for On-Demand Information Extraction
Machine Translation (Oral)
Room: West 1
- Multilingual Pixel Representations for Translation and Effective Cross-lingual Transfer
- Non-autoregressive Streaming Transformer for Simultaneous Translation
- IMTLab: An Open-Source Platform for Building, Evaluating, and Diagnosing Interactive Machine Translation Systems
- Integrating Language Models into Direct Speech Translation: An Inference-Time Solution to Control Gender Inflection
- Crossing the Threshold: Idiomatic Machine Translation through Retrieval Augmentation and Loss Weighting
- Understanding and Detecting Hallucinations in Neural Machine Translation via Model Introspection
Session 3
Oral Presentations
Commonsense Reasoning (Oral)
Room: Central 1
- Vera: A General-Purpose Plausibility Estimation Model for Commonsense Statements
- Large Language Models Can Self-Improve
- CRoW: Benchmarking Commonsense Reasoning in Real-World Tasks
- DialCoT Meets PPO: Decomposing and Exploring Reasoning Paths in Smaller Language Models
- GD-COMET: A Geo-Diverse Commonsense Inference Model
- Crystal: Introspective Reasoners Reinforced with Self-Feedback
Discourse and Pragmatics (Oral)
Room: East
- COHESENTIA: A Novel Benchmark of Incremental versus Holistic Assessment of Coherence in Generated Texts
- Improving Long Document Topic Segmentation Models With Enhanced Coherence Modeling
- Improving Dialogue Discourse Parsing via Reply-to Structures of Addressee Recognition
- QUDeval: The Evaluation of Questions Under Discussion Discourse Parsing
- Seq2seq is All You Need for Coreference Resolution
- Prompt-based Logical Semantics Enhancement for Implicit Discourse Relation Recognition
Efficient Methods for NLP 1 (Oral)
Room: Central 3
- Learning to Predict Task Transferability via Soft Prompt
- Byte Pair Encoding for Symbolic Music
- Understanding the Effect of Model Compression on Social Bias in Large Language Models
- Knowledge Distillation ≈ Label Smoothing: Fact or Fallacy?
- The Framework Tax: Disparities Between Inference Efficiency in NLP Research and Deployment
- Making Large Language Models Better Data Creators
Ethics in NLP (Oral)
Room: West 1
- TrojanSQL: SQL Injection against Natural Language Interface to Database
- ToViLaG: Your Visual-Language Generative Model is Also An Evildoer
- ROBBIE: Robust Bias Evaluation of Large Generative Language Models
- We are Who We Cite: Bridges of Influence Between Natural Language Processing and Other Academic Fields
- Deciphering Stereotypes in Pre-Trained Language Models
- Copyright Violations and Large Language Models
Information Extraction 2 (Oral)
Room: West 3
- ViStruct: Visual Structural Knowledge Extraction via Curriculum Guided Code-Vision Representation
- CorefPrompt: Prompt-based Event Coreference Resolution by Measuring Event Type and Argument Compatibilities
- Continual Event Extraction with Semantic Confusion Rectification
- CORE: A Few-Shot Company Relation Classification Dataset for Robust Domain Adaptation.
- SpEL: Structured Prediction for Entity Linking
- TIMELINE: Exhaustive Annotation of Temporal Relations Supporting the Automatic Ordering of Events in News Articles
Phonology, Morphology, and Word Segmentation (Oral)
Room: West 2
- Cognate Transformer for Automated Phonological Reconstruction and Cognate Reflex Prediction
- Understanding Compositional Data Augmentation in Typologically Diverse Morphological Inflection
- TopWORDS-Poetry: Simultaneous Text Segmentation and Word Discovery for Classical Chinese Poetry via Bayesian Inference
- Improved Unsupervised Chinese Word Segmentation Using Pre-trained Knowledge and Pseudo-labeling Transfer
- Exploring Linguistic Probes for Morphological Inflection
- On the Role of Morphological Information for Contextual Lemmatization
Session 4
Oral Presentations
Dialogue and Interactive Systems 2 (Oral)
Room: West 3
- Just Adjust One Prompt: Enhancing In-Context Dialogue Scoring via Constructing the Optimal Subgraph of Demonstrations and Prompts
- An Integrative Survey on Mental Health Conversational Agents to Bridge Computer Science and Medical Perspectives
- From Multilingual Complexity to Emotional Clarity: Leveraging Commonsense to Unveil Emotions in Code-Mixed Dialogues
- e-THERAPIST: I suggest you to cultivate a mindset of positivity and nurture uplifting thoughts
- ReSee: Responding through Seeing Fine-grained Visual Knowledge in Open-domain Dialogue
- PK-ICR: Persona-Knowledge Interactive Multi-Context Retrieval for Grounded Dialogue
Information Retrieval and Text Mining (Oral)
Room: West 1
- Hybrid Inverted Index Is a Robust Accelerator for Dense Retrieval
- Robust Prompt Optimization for Large Language Models Against Distribution Shifts
- WSDMS: Debunk Fake News via Weakly Supervised Detection of Misinforming Sentences with Contextualized Social Wisdom
- Learning to Describe for Predicting Zero-shot Drug-Drug Interactions
- Is ChatGPT Good at Search? Investigating Large Language Models as Re-Ranking Agents
- Goal-Driven Explainable Clustering via Language Descriptions
Interpretability, Interactivity, and Analysis of Models for NLP 1 (Oral)
Room: East
- Dissecting Recall of Factual Associations in Auto-Regressive Language Models
- Interpreting Embedding Spaces by Conceptualization
- Norm of Word Embedding Encodes Information Gain
- Assessing Step-by-Step Reasoning against Lexical Negation: A Case Study on Syllogism
- Can LLMs Facilitate Interpretation of Pre-trained Language Models?
- Can You Follow Me? Testing Situational Understanding for ChatGPT
Language Grounding to Vision, Robotics and Beyond (Oral)
Room: Central 1
- Models See Hallucinations: Evaluating the Factuality in Video Captioning
- Describe Me an Auklet: Generating Grounded Perceptual Category Descriptions
- Reading Books is Great, But Not if You Are Driving! Visually Grounded Reasoning about Defeasible Commonsense Norms
- Bridging the Digital Divide: Performance Variation across Socio-Economic Factors in Vision-Language Models
- 3DRP-Net: 3D Relative Position-aware Network for 3D Visual Grounding
- Localizing Active Objects from Egocentric Vision with Symbolic World Knowledge
Language Modeling and Analysis of Language Models 1 (Oral)
Room: Central 3
- FActScore: Fine-grained Atomic Evaluation of Factual Precision in Long Form Text Generation
- ViSoBERT: A Pre-Trained Language Model for Vietnamese Social Media Text Processing
- Navigating the Grey Area: How Expressions of Uncertainty and Overconfidence Affect Language Models
- CodeT5+: Open Code Large Language Models for Code Understanding and Generation
- MAGNIFICo: Evaluating the In-Context Learning Ability of Large Language Models to Generalize to Novel Interpretations
- CombLM: Adapting Black-Box Language Models through Small Fine-Tuned Models
Linguistic Theories, Cognitive Modeling and Psycholinguistics (Oral)
Room: West 2
- Information Value: Measuring Utterance Predictability as Distance from Plausible Alternatives
- Addressing Linguistic Bias through a Contrastive Analysis of Academic Writing in the NLP Domain
- The neural dynamics of word recognition and integration
- Quantifying the redundancy between prosody and text
- Interpreting and Exploiting Functional Specialization in Multi-Head Attention under Multi-task Learning
- Assessing the influence of attractor-verb distance on grammatical agreement in humans and language models
Timezone: Conference (Singapore) UTC Browser
Poster_Demo_Industry Hybrid 4
Poster Presentations
Dialogue and Interactive Systems (Poster)
Room: East Foyer(Virtual)
Efficient Methods for NLP (Poster)
Room: East Foyer(Virtual)
- TLM: Token-Level Masking for Transformers
- Simple and Effective Input Reformulations for Translation
- Federated Learning of Large Language Models with Parameter-Efficient Prompt Tuning and Adaptive Optimization
- Self-Evolution Learning for Mixup: Enhance Data Augmentation on Few-Shot Text Classification Tasks
- Optimizing Retrieval-augmented Reader Models via Token Elimination
- Efficient Classification of Long Documents via State-Space Models
- Bootstrapping Small \& High Performance Language Models with Unmasking-Removal Training Policy
Industry (Poster)
Room: East Foyer(Virtual)
Resources and Evaluation (Poster)
Room: East Foyer(Virtual)
- Human Raters Cannot Distinguish English Translations from Original English Texts
- TheoremQA: A Theorem-driven Question Answering Dataset
- Revisiting De-Identification of Electronic Medical Records: Evaluation of Within- and Cross-Hospital Generalization
- DiNeR: A Large Realistic Dataset for Evaluating Compositional Generalization
- Somali Information Retrieval Corpus: Bridging the Gap between Query Translation and Dedicated Language Resources
- Holistic Inter-Annotator Agreement and Corpus Coherence Estimation in a Large-scale Multilingual Annotation Campaign
- More Than Spoken Words: Nonverbal Message Extraction and Generation
- MAUD: An Expert-Annotated Legal NLP Dataset for Merger Agreement Understanding
- A Benchmark for Reasoning with Spatial Prepositions
Semantics: Lexical, Sentence level, Document Level, Textual Inference, etc. (Poster)
Room: East Foyer(Virtual)
Sentiment Analysis, Stylistic Analysis, and Argument Mining (Poster)
Room: East Foyer(Virtual)
Speech and Multimodality (Poster)
Room: East Foyer(Virtual)
Theme Track: Large Language Models and the Future of NLP (Poster)
Room: East Foyer(Virtual)
- SeqXGPT: Sentence-Level AI-Generated Text Detection
- Character-LLM: A Trainable Agent for Role-Playing
- UPRISE: Universal Prompt Retrieval for Improving Zero-Shot Evaluation
- On the Benefits of Learning to Route in Mixture-of-Experts Models
- MoPe: Model Perturbation based Privacy Attacks on Language Models
Poster_Demo_Industry Hybrid 5
Poster Presentations
Demo (Poster)
Room: East Foyer(Virtual)
- Descriptive Knowledge Graph in Biomedical Domain
- ZhuJiu: A Multi-dimensional, Multi-faceted Chinese Benchmark for Large Language Models
- EfficientOCR: An Extensible, Open-Source Package for Efficiently Digitizing World Knowledge
- ModelScope-Agent: Building Your Customizable Agent System with Open-source Large Language Models
Efficient Methods for NLP (Poster)
Room: East Foyer(Virtual)
Ethics in NLP (Poster)
Room: East Foyer(Virtual)
- Fair Without Leveling Down: A New Intersectional Fairness Definition
- Discourse Structures Guided Fine-grained Propaganda Identification
- StereoMap: Quantifying the Awareness of Human-like Stereotypes in Large Language Models
- Centering the Margins: Outlier-Based Identification of Harmed Populations in Toxicity Detection
Industry (Poster)
Room: East Foyer(Virtual)
Interpretability, Interactivity, and Analysis of Models for NLP (Poster)
Room: East Foyer(Virtual)
- This Reads Like That: Deep Learning for Interpretable Natural Language Processing
- Cognitive Dissonance: Why Do Language Model Outputs Disagree with Internal Representations of Truthfulness?
- A State-Vector Framework for Dataset Effects
- Understanding the Inner-workings of Language Models Through Representation Dissimilarity
- Do Transformers Parse while Predicting the Masked Word?
Summarization (Poster)
Room: East Foyer(Virtual)
- GEMINI: Controlling The Sentence-Level Summary Style in Abstractive Text Summarization
- Boosting Summarization with Normalizing Flows and Aggressive Training
- Generating Summaries with Controllable Readability Levels
- Improving Summarization with Human Edits
- FactKB: Generalizable Factuality Evaluation using Language Models Enhanced with Factual Knowledge
- Towards Interpretable and Efficient Automatic Reference-Based Summarization Evaluation
Theme Track: Large Language Models and the Future of NLP (Poster)
Room: East Foyer(Virtual)
- Why LLMs Hallucinate, and How to Get (Evidential) Closure: Perceptual, Intensional, and Extensional Learning for Faithful Natural Language Generation
- Hallucination Detection for Generative Large Language Models by Bayesian Sequential Estimation
- Polyglot or Not? Measuring Multilingual Encyclopedic Knowledge in Foundation Models
- Evaluation of African American Language Bias in Natural Language Generation
Poster_Demo_Industry_Findings In-person 4
Poster Presentations
Commonsense Reasoning (Poster)
Room: East Foyer
- Knowledge Rumination for Pre-trained Language Models
- IAG: Induction-Augmented Generation Framework for Answering Reasoning Questions
- DIVE: Towards Descriptive and Diverse Visual Commonsense Generation
- From Heuristic to Analytic: Cognitively Motivated Strategies for Coherent Physical Commonsense Reasoning
- Are All Steps Equally Important? Benchmarking Essentiality Detection in Event Processes
- Editing Common Sense in Transformers
- LINC: A Neurosymbolic Approach for Logical Reasoning by Combining Language Models with First-Order Logic Provers
- What Makes it Ok to Set a Fire? Iterative Self-distillation of Contexts and Rationales for Disambiguating Defeasible Social and Moral Situations
- CAR: Conceptualization-Augmented Reasoner for Zero-Shot Commonsense Question Answering
Demo (Poster)
Room: East Foyer
- LM-Polygraph: Uncertainty Estimation for Language Models
- prompterator: Iterate efficiently towards more effective prompts
- PaperMage: A Unified Toolkit for Processing, Representing, and Manipulating Visually-Rich Scientific Documents
- OmniEvent: A Comprehensive, Fair, and Easy-to-Use Toolkit for Event Understanding
Dialogue and Interactive Systems (Poster)
Room: East Foyer
Efficient Methods for NLP (Poster)
Room: East Foyer
- Active Instruction Tuning: Improving Cross-Task Generalization by Training on Prompt Sensitive Tasks
- SMoP: Towards Efficient and Effective Prompt Tuning with Sparse Mixture-of-Prompts
- Influence Scores at Scale for Efficient Language Data Sampling
- Fast and Robust Early-Exiting Framework for Autoregressive Language Models with Synchronized Parallel Decoding
- Simple Temporal Adaptation to Changing Label Sets: Hashtag Prediction via Dense KNN
- Token Prediction as Implicit Classification to Identify LLM-Generated Text
- Combining Denoising Autoencoders with Contrastive Learning to fine-tune Transformer Models
- Sparse Universal Transformer
- Towards Robust Pruning: An Adaptive Knowledge-Retention Pruning Strategy for Language Models
- Enhancing Computation Efficiency in Large Language Models through Weight and Activation Quantization
- Sparse Low-rank Adaptation of Pre-trained Language Models
- An Efficient Self-Supervised Cross-View Training For Sentence Embedding
Ethics in NLP (Poster)
Room: East Foyer
Information Extraction (Poster)
Room: East Foyer
- Always the Best Fit: Adaptive Domain Gap Filling from Causal Perspective for Few-Shot Relation Extraction
- On Event Individuation for Document-Level Information Extraction
- EconBERTa: Towards Robust Extraction of Named Entities in Economics
- CASSI: Contextual and Semantic Structure-based Interpolation Augmentation for Low-Resource NER
- Large Language Model Is Not a Good Few-shot Information Extractor, but a Good Reranker for Hard Samples!
Interpretability, Interactivity, and Analysis of Models for NLP (Poster)
Room: East Foyer
Language Modeling and Analysis of Language Models (Poster)
Room: East Foyer
Linguistic Theories, Cognitive Modeling, and Psycholinguistics (Poster)
Room: East Foyer
Machine Learning for NLP (Poster)
Room: East Foyer
Machine Translation (Poster)
Room: East Foyer
- $\textit{SelectNoise:}$ Unsupervised Noise Injection to Enable Zero-Shot Machine Translation for Extremely Low-resource Languages
- Non-Autoregressive Document-Level Machine Translation
- Epsilon Sampling Rocks: Investigating Sampling Strategies for Minimum Bayes Risk Decoding for Machine Translation
- Simultaneous Machine Translation with Tailored Reference
- CTQScorer: Combining Multiple Features for In-context Example Selection for Machine Translation
Multilinguality and Linguistic Diversity (Poster)
Room: East Foyer
Natural Language Generation (Poster)
Room: East Foyer
NLP Applications (Poster)
Room: East Foyer
Resources and Evaluation (Poster)
Room: East Foyer
- Reduce Human Labor On Evaluating Conversational Information Retrieval System: A Human-Machine Collaboration Approach
- CRAB: Assessing the Strength of Causal Relationships Between Real-world Events
- Chinese Lexical Substitution: Dataset and Method
- Countering Misinformation via Emotional Response Generation
- SEAHORSE: A Multilingual, Multifaceted Dataset for Summarization Evaluation
- NormDial: A Comparable Bilingual Synthetic Dialog Dataset for Modeling Social Norm Adherence and Violation
- CLAIR: Evaluating Image Captions with Large Language Models
- M4: A Multi-Level, Multi-Task, and Multi-Domain Medical Benchmark for Language Model Evaluation
- Calc-X and Calcformers: Empowering Arithmetical Chain-of-Thought through Interaction with Symbolic Systems
- Larger Probes Tell a Different Story: Extending Psycholinguistic Datasets Via In-Context Learning
- ReCEval: Evaluating Reasoning Chains via Correctness and Informativeness
- DUnE: Dataset for Unified Editing
- Dancing Between Success and Failure: Edit-level Simplification Evaluation using SALSA
- TCFLE-8: a Corpus of Learner Written Productions for French as a Foreign Language and its Application to Automated Essay Scoring
- Hi Guys or Hi Folks? Benchmarking Gender-Neutral Machine Translation with the GeNTE Corpus
- Using Artificial French Data to Understand the Emergence of Gender Bias in Transformer Language Models
- Construction Artifacts in Metaphor Identification Datasets
- Increasing Coverage and Precision of Textual Information in Multilingual Knowledge Graphs
- ALCUNA: Large Language Models Meet New Knowledge
- CRT-QA: A Dataset of Complex Reasoning Question Answering over Tabular Data
- EtiCor: Corpus for Analyzing LLMs for Etiquettes
- BRAINTEASER: Lateral Thinking Puzzles for Large Language Models
- SciRepEval: A Multi-Format Benchmark for Scientific Document Representations
- VECHR: A Dataset for Explainable and Robust Classification of Vulnerability Type in the European Court of Human Rights
- Is ChatGPT a General-Purpose Natural Language Processing Task Solver?
- PASTA: A Dataset for Modeling PArticipant STAtes in Narratives
- Cross-functional Analysis of Generalisation in Behavioural Learning
- Benchmarking the Generation of Fact Checking Explanations
- Benchmarking Large Language Models for News Summarization
- DMDD: A Large-Scale Dataset for Dataset Mentions Detection
- AfriSpeech-200: Pan-African Accented Speech Dataset for Clinical and General Domain ASR
- Discover, Explain, Improve: An Automatic Slice Detection Benchmark for Natural Language Processing
- AmbiFC: Fact-Checking Ambiguous Claims with Evidence
- Multi3WOZ: A Multilingual, Multi-Domain, Multi-Parallel Dataset for Training and Evaluating Culturally Adapted Task-Oriented Dialog Systems
- Cross-lingual Open-Retrieval Question Answering for African Languages
- QUADRo: Dataset and Models for QUestion-Answer Database Retrieval
- IRFL: Image Recognition of Figurative Language
- Automatic Analysis of Substantiation in Scientific Peer Reviews
- ECHo: A Visio-Linguistic Dataset for Event Causality Inference via Human-Centric Reasoning
Semantics: Lexical, Sentence level, Document Level, Textual Inference, etc. (Poster)
Room: East Foyer
Sentiment Analysis, Stylistic Analysis, and Argument Mining (Poster)
Room: East Foyer
- A Self-enhancement Multitask Framework for Unsupervised Aspect Category Detection
- TATA: Stance Detection via Topic-Agnostic and Topic-Aware Embeddings
- Introducing Rhetorical Parallelism Detection: A New Task with Datasets, Metrics, and Baselines
- Tagging-Assisted Generation Model with Encoder and Decoder Supervision for Aspect Sentiment Triplet Extraction
- M2DF: Multi-grained Multi-curriculum Denoising Framework for Multimodal Aspect-based Sentiment Analysis
- Adapt in Contexts: Retrieval-Augmented Domain Adaptation via In-Context Learning
- SOUL: Towards Sentiment and Opinion Understanding of Language
- Standardizing Distress Analysis: Emotion-Driven Distress Identification and Cause Extraction (DICE) in Multimodal Online Posts
- Do Differences in Values Influence Disagreements in Online Discussions?
- Elevating Code-mixed Text Handling through Auditory Information of Words
- Supervised Gradual Machine Learning for Aspect-Term Sentiment Analysis
- Retrieval-Augmented Few-shot Text Classification
- Argument mining as a multi-hop generative machine reading comprehension task
Speech and Multimodality (Poster)
Room: East Foyer
Syntax, Parsing and their Applications (Poster)
Room: East Foyer
- Efficient Algorithms for Recognizing Weighted Tree-Adjoining Languages
- Pushdown Layers: Encoding Recursive Structure in Transformer Language Models
- The Generalized Left-Corner Transformation
- Learning to Paraphrase Sentences to Different Complexity Levels
- SmartSpanNER: Making SpanNER Robust in Low Resource Scenarios
Theme Track: Large Language Models and the Future of NLP (Poster)
Room: East Foyer
- SelfCheckGPT: Zero-Resource Black-Box Hallucination Detection for Generative Large Language Models
- Large Language Models are Temporal and Causal Reasoners for Video Question Answering
- Synthetic Data Generation with Large Language Models for Text Classification: Potential and Limitations
- Skill-Based Few-Shot Selection for In-Context Learning
- MoT: Memory-of-Thought Enables ChatGPT to Self-Improve
- How Do Large Language Models Capture the Ever-changing World Knowledge? A Review of Recent Advances
- Explore-Instruct: Enhancing Domain-Specific Instruction Coverage through Active Exploration
- EasyQuant: An Efficient Data-free Quantization Algorithm for LLMs
- Do All Languages Cost the Same? Tokenization in the Era of Commercial Language Models
- Editing Large Language Models: Problems, Methods, and Opportunities
- Revisiting Automated Topic Model Evaluation with Large Language Models
- Prompting with Pseudo-Code Instructions
- Large Language Models Meet Open-World Intent Discovery and Recognition: An Evaluation of ChatGPT
- Enhancing Uncertainty-Based Hallucination Detection with Stronger Focus
- The Troubling Emergence of Hallucination in Large Language Models - An Extensive Definition, Quantification, and Prescriptive Remediations
- API-Bank: A Comprehensive Benchmark for Tool-Augmented LLMs
- Document-Level Machine Translation with Large Language Models
- Shall We Pretrain Autoregressive Language Models with Retrieval? A Comprehensive Study
- clembench: Using Game Play to Evaluate Chat-Optimized Language Models as Conversational Agents
- Task-Level Thinking Steps Help Large Language Models for Challenging Classification Task
- Establishing Trustworthiness: Rethinking Tasks and Model Evaluation
- Democratizing Reasoning Ability: Tailored Learning from Large Language Model
- Defining a New NLP Playground
- Evaluating the Knowledge Base Completion Potential of GPT
- Towards large language model-based personal agents in the enterprise: Current trends and open problems
Poster_Demo_Industry_Findings In-person 5
Poster Presentations
Computational Social Science and Cultural Analytics (Poster)
Room: East Foyer
Dialogue and Interactive Systems (Poster)
Room: East Foyer
Efficient Methods for NLP (Poster)
Room: East Foyer
- Adaptive Gating in Mixture-of-Experts based Language Models
- Exploring the Impact of Model Scaling on Parameter-Efficient Tuning
- ATFormer: A Learned Performance Model with Transfer Learning Across Devices for Deep Learning Tensor Programs
- LLM-FP4: 4-Bit Floating-Point Quantized Transformers
- Are Compressed Language Models Less Subgroup Robust?
- Vicinal Risk Minimization for Few-Shot Cross-lingual Transfer in Abusive Language Detection
- Compressing Context to Enhance Inference Efficiency of Large Language Models
- Context Compression for Auto-regressive Transformers with Sentinel Tokens
- LLM-Adapters: An Adapter Family for Parameter-Efficient Fine-Tuning of Large Language Models
- HyperRouter: Towards Efficient Training and Inference of Sparse Mixture of Experts
- Anchoring Fine-tuning of Sentence Transformer with Semantic Label Information for Efficient Truly Few-shot Classification
- A Frustratingly Easy Post-Training Quantization Scheme for LLMs
- Outlier Suppression+: Accurate quantization of large language models by equivalent and effective shifting and scaling
- Faster Minimum Bayes Risk Decoding with Confidence-based Pruning
- Revisiting Block-based Quantisation: What is Important for Sub-8-bit LLM Inference?
- Parameter-efficient Tuning for Large Language Model without Calculating Its Gradients
- Leap-of-Thought: Accelerating Transformers via Dynamic Token Routing
- Prototype-based HyperAdapter for Sample-Efficient Multi-task Tuning
- Universal Self-Adaptive Prompting
- Accelerating Toeplitz Neural Network with Constant-time Inference Complexity
- Enhancing Scalability of Pre-trained Language Models via Efficient Parameter Sharing
- Let's Synthesize Step by Step: Iterative Dataset Synthesis with Large Language Models by Extrapolating Errors from Small Models
Ethics in NLP (Poster)
Room: East Foyer
- Mirages. On Anthropomorphism in Dialogue Systems
- MeaeQ: Mount Model Extraction Attacks with Efficient Queries
- CT-GAT: Cross-Task Generative Adversarial Attack based on Transferability
- Prompt as Triggers for Backdoor Attack: Examining the Vulnerability in Language Models
- Cross-lingual Transfer Can Worsen Bias in Sentiment Analysis
- Conceptor-Aided Debiasing of Large Language Models
- A Rose by Any Other Name would not Smell as Sweet: Social Bias in Names Mistranslation
- Comparing Biases and the Impact of Multilingual Training across Multiple Languages
- A Predictive Factor Analysis of Social Biases and Task-Performance in Pretrained Masked Language Models
- Gender Biases in Automatic Evaluation Metrics for Image Captioning
- PEFTDebias : Capturing debiasing information using PEFTs
- A Tale of Pronouns: Interpretability Informs Gender Bias Mitigation for Fairer Instruction-Tuned Machine Translation
Information Extraction (Poster)
Room: East Foyer
Interpretability, Interactivity, and Analysis of Models for NLP (Poster)
Room: East Foyer
- Exchange-of-Thought: Enhancing Large Language Model Capabilities through Cross-Model Communication
- Can We Edit Factual Knowledge by In-Context Learning?
- Discovering Universal Geometry in Embeddings with ICA
- Disentangling Transformer Language Models as Superposed Topic Models
- Unraveling Feature Extraction Mechanisms in Neural Networks
- Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning
- Memorisation Cartography: Mapping out the Memorisation-Generalisation Continuum in Neural Machine Translation
- A Mechanistic Interpretation of Arithmetic Reasoning in Language Models using Causal Mediation Analysis
- Text encoders bottleneck compositionality in contrastive vision-language models
- "Are Your Explanations Reliable?" Investigating the Stability of LIME in Explaining Text Classifiers by Marrying XAI and Adversarial Attack
- When are Lemons Purple? The Concept Association Bias of Vision-Language Models
- MaNtLE: Model-agnostic Natural Language Explainer
- Text-Transport: Toward Learning Causal Effects of Natural Language
- A Study on Accessing Linguistic Information in Pre-Trained Language Models by Using Prompts
- IBADR: an Iterative Bias-Aware Dataset Refinement Framework for Debiasing NLU models
- Identifying Statements Crucial for Awareness of Interpretive Nonsense to Prevent Communication Breakdowns
- Outlier Dimensions Encode Task Specific Knowledge
- Rather a Nurse than a Physician - Contrastive Explanations under Investigation
- Conceptual structure coheres in human cognition but not in large language models
- Generative Adversarial Training with Perturbed Token Detection for Model Robustness
- Dynamic Top-k Estimation Consolidates Disagreement between Feature Attribution Methods
- An Attribution Method for Siamese Encoders
- Cross-Modal Conceptualization in Bottleneck Models
- Language Models with Rationality
- Towards a Mechanistic Interpretation of Multi-Step Reasoning Capabilities of Language Models
- When Language Models Fall in Love: Animacy Processing in Transformer Language Models
- Characterizing Mechanisms for Factual Recall in Language Models
- Hyperpolyglot LLMs: Cross-Lingual Interpretability in Token Embeddings
- Deep Natural Language Feature Learning for Interpretable Prediction
- Increasing Probability Mass on Answer Choices Does Not Always Improve Accuracy
- We Need to Talk About Reproducibility in NLP Model Comparison
- InterFair: Debiasing with Natural Language Feedback for Fair Interpretable Predictions
- VISIT: Visualizing and Interpreting the Semantic Information Flow of Transformers
Language Grounding to Vision, Robotics and Beyond (Poster)
Room: East Foyer
Language Modeling and Analysis of Language Models (Poster)
Room: East Foyer
- What Makes Chain-of-Thought Prompting Effective? A Counterfactual Study
- Boosting Prompt-Based Self-Training With Mapping-Free Automatic Verbalizer for Multi-Class Classification
- Subspace Chronicles: How Linguistic Information Emerges, Shifts and Interacts during Language Model Training
- Auto-Instruct: Automatic Instruction Generation and Ranking for Black-Box Language Models
Linguistic Theories, Cognitive Modeling, and Psycholinguistics (Poster)
Room: East Foyer
Machine Translation (Poster)
Room: East Foyer
Multilinguality and Linguistic Diversity (Poster)
Room: East Foyer
Natural Language Generation (Poster)
Room: East Foyer
Question Answering (Poster)
Room: East Foyer
Resources and Evaluation (Poster)
Room: East Foyer
- MathDial: A Dialogue Tutoring Dataset with Rich Pedagogical Properties Grounded in Math Reasoning Problems
- BioDEX: Large-Scale Biomedical Adverse Drug Event Extraction for Real-World Pharmacovigilance
- DiQAD: A Benchmark Dataset for Open-domain Dialogue Quality Assessment
- Dolphin: A Challenging and Diverse Benchmark for Arabic NLG
Semantics: Lexical, Sentence level, Document Level, Textual Inference, etc. (Poster)
Room: East Foyer
Sentiment Analysis, Stylistic Analysis, and Argument Mining (Poster)
Room: East Foyer
Summarization (Poster)
Room: East Foyer
- DecipherPref: Analyzing Influential Factors in Human Preference Judgments via GPT-4
- QTSumm: Query-Focused Summarization over Tabular Data
- Transformer-based Live Update Generation for Soccer Matches from Microblog Posts
- `Don't Get Too Technical with Me': A Discourse Structure-Based Framework for Automatic Science Journalism
- MediaHG: Rethinking Eye-catchy Features in Social Media Headline Generation
- Select, Prompt, Filter: Distilling Large Language Models for Summarizing Conversations
- OpenAsp: A Benchmark for Multi-document Open Aspect-based Summarization
- Detecting and Mitigating Hallucinations in Multilingual Summarisation
- FaMeSumm: Investigating and Improving Faithfulness of Medical Summarization
- Reconstruct Before Summarize: An Efficient Two-Step Framework for Condensing and Summarizing Meeting Transcripts
- EntSUMv2: Dataset, Models and Evaluation for More Abstractive Entity-Centric Summarization
- Background Summarization of Event Timelines
- Enhancing Biomedical Lay Summarisation with External Knowledge Graphs
- Accuracy is not enough: Evaluating Personalization in Summarizers
- Summarizing Multiple Documents with Conversational Structure for Meta-Review Generation
Theme Track: Large Language Models and the Future of NLP (Poster)
Room: East Foyer
- BioPlanner: Automatic Evaluation of LLMs on Protocol Planning in Biology
- Plan, Verify and Switch: Integrated Reasoning with Diverse X-of-Thoughts
- MolCA: Molecular Graph-Language Modeling with Cross-Modal Projector and Uni-Modal Adapter
- Regulation and NLP (RegNLP): Taming Large Language Models
- Self-ICL: Zero-Shot In-Context Learning with Self-Generated Demonstrations
- The CoT Collection: Improving Zero-shot and Few-shot Learning of Language Models via Chain-of-Thought Fine-Tuning
- Improving Diversity of Demographic Representation in Large Language Models via Collective-Critiques and Self-Voting
- “Mistakes Help Us Grow”: Facilitating and Evaluating Growth Mindset Supportive Language in Classrooms
- Theory of Mind for Multi-Agent Collaboration via Large Language Models
- Enabling Large Language Models to Generate Text with Citations
- The Shifted and The Overlooked: A Task-oriented Investigation of User-GPT Interactions
- The Past, Present and Better Future of Feedback Learning in Large Language Models for Subjective Human Preferences and Values
- Ignore This Title and HackAPrompt: Exposing Systemic Vulnerabilities of LLMs Through a Global Prompt Hacking Competition
- GPT-RE: In-context Learning for Relation Extraction using Large Language Models
- MEGA: Multilingual Evaluation of Generative AI
- Can Large Language Models Capture Dissenting Human Voices?
- CRUSH4SQL: Collective Retrieval Using Schema Hallucination For Text2SQL
- Small Language Models Fine-tuned to Coordinate Larger Language Models improve Complex Reasoning
- INFORM : Information eNtropy based multi-step reasoning FOR large language Models
- Towards Reliable Misinformation Mitigation: Generalization, Uncertainty, and GPT-4
- Doolittle: Benchmarks and Corpora for Academic Writing Formalization
- Automatic Prompt Augmentation and Selection with Chain-of-Thought from Labeled Data
- Towards A Holistic Landscape of Situated Theory of Mind in Large Language Models
Poster_Demo_Industry_Findings Virtual 4
Poster Presentations
Computational Social Science and Cultural Analytics (Poster)
Room: Virtual-Gathertown
Dialogue and Interactive Systems (Poster)
Room: Virtual-Gathertown
- Cue-CoT: Chain-of-thought Prompting for Responding to In-depth Dialogue Questions with LLMs
- Large Language Models as Source Planner for Personalized Knowledge-grounded Dialogues
- Multi-Task Learning of Query Generation and Classification for Generative Conversational Question Rewriting
- Improving Factual Consistency for Knowledge-Grounded Dialogue Systems via Knowledge Enhancement and Alignment
- Domain Private Transformers for Multi-Domain Dialog Systems
Discourse and Pragmatics (Poster)
Room: Virtual-Gathertown
Efficient Methods for NLP (Poster)
Room: Virtual-Gathertown
- SIR-ABSC: Incorporating Syntax into RoBERTa-based Sentiment Analysis Models with a Special Aggregator Token
- Boosting Inference Efficiency: Unleashing the Power of Parameter-Shared Pre-trained Language Models
- Image and Text: Fighting the same Battle? Super Resolution Learning for Imbalanced Text Classification
Ethics in NLP (Poster)
Room: Virtual-Gathertown
Human-Centered NLP (Poster)
Room: Virtual-Gathertown
Information Extraction (Poster)
Room: Virtual-Gathertown
- impact of sample selection on in-context learning for entity extraction from scientific writing
- Unsupervised Candidate Answer Extraction through Differentiable Masker-Reconstructor Model
- A Read-and-Select Framework for Zero-shot Entity Linking
- CoVariance-based Causal Debiasing for Entity and Relation Extraction
Information Retrieval and Text Mining (Poster)
Room: Virtual-Gathertown
Interpretability, Interactivity, and Analysis of Models for NLP (Poster)
Room: Virtual-Gathertown
Language Grounding to Vision, Robotics and Beyond (Poster)
Room: Virtual-Gathertown
Language Modeling and Analysis of Language Models (Poster)
Room: Virtual-Gathertown
Multilinguality and Linguistic Diversity (Poster)
Room: Virtual-Gathertown
NLP Applications (Poster)
Room: Virtual-Gathertown
- VISTA: Visual-Textual Knowledge Graph Representation Learning
- Causal Inference from Text: Unveiling Interactions between Variables
- Task-Aware Self-Supervised Framework for Dialogue Discourse Parsing
- StyleBART: Decorate Pretrained Model with Style Adapters for Unsupervised Stylistic Headline Generation
- Exploring the Potential of Large Language Models in Generating Code-Tracing Questions for Introductory Programming Courses
- PROTEGE: Prompt-based Diverse Question Generation from Web Articles
- SDOH-NLI: a Dataset for Inferring Social Determinants of Health from Clinical Notes
Question Answering (Poster)
Room: Virtual-Gathertown
Resources and Evaluation (Poster)
Room: Virtual-Gathertown
- On Evaluation of Bangla Word Analogies
- Dialect-to-Standard Normalization: A Large-Scale Multilingual Evaluation
- Statistically Profiling Biases in Natural Language Reasoning Datasets and Models
- Can ChatGPT Perform Reasoning Using the IRAC Method in Analyzing Legal Scenarios Like a Lawyer?
- IndiSocialFT: Multilingual Word Representation for Indian languages in code-mixed environment
- Robustness Tests for Automatic Machine Translation Metrics with Adversarial Attacks
- A Rewriting Approach for Gender Inclusivity in Portuguese
- BLM-s/lE: A structured dataset of English spray-load verb alternations for testing generalization in LLMs
Semantics: Lexical, Sentence level, Document Level, Textual Inference, etc. (Poster)
Room: Virtual-Gathertown
Sentiment Analysis, Stylistic Analysis, and Argument Mining (Poster)
Room: Virtual-Gathertown
- Aspect-Category Enhanced Learning with a Neural Coherence Model for Implicit Sentiment Analysis
- Dynamic Stance: Modeling Discussions by Labeling the Interactions
- Analysis of Style-Shifting on Social Media: Using Neural Language Model Conditioned by Social Meanings
- Misery Loves Complexity: Exploring Linguistic Complexity in the Context of Emotion Detection
Summarization (Poster)
Room: Virtual-Gathertown
Theme Track: Large Language Models and the Future of NLP (Poster)
Room: Virtual-Gathertown
- SUT: Active Defects Probing for Transcompiler Models
- NLP Evaluation in trouble: On the Need to Measure LLM Data Contamination for each Benchmark
- LDM$^2$: A Large Decision Model Imitating Human Cognition with Dynamic Memory Enhancement
- Self-Polish: Enhance Reasoning in Large Language Models via Problem Refinement
- The Past, Present, and Future of Typological Databases in NLP
- Search Augmented Instruction Learning
Poster_Demo_Industry_Findings Virtual 5
Poster Presentations
Computational Social Science and Cultural Analytics (Poster)
Room: Virtual-Gathertown
Dialogue and Interactive Systems (Poster)
Room: Virtual-Gathertown
Efficient Methods for NLP (Poster)
Room: Virtual-Gathertown
- Efficiently Enhancing Zero-Shot Performance of Instruction Following Model via Retrieval of Soft Prompt
- ASPIRO: Any-shot Structured Parsing-error-Induced ReprOmpting for Consistent Data-to-Text Generation
- INGENIOUS: Using Informative Data Subsets for Efficient Pre-Training of Language Models
- On Surgical Fine-tuning for Language Encoders
Ethics in NLP (Poster)
Room: Virtual-Gathertown
Human-Centered NLP (Poster)
Room: Virtual-Gathertown
Information Extraction (Poster)
Room: Virtual-Gathertown
Information Retrieval and Text Mining (Poster)
Room: Virtual-Gathertown
Interpretability, Interactivity, and Analysis of Models for NLP (Poster)
Room: Virtual-Gathertown
Language Grounding to Vision, Robotics and Beyond (Poster)
Room: Virtual-Gathertown
Language Modeling and Analysis of Language Models (Poster)
Room: Virtual-Gathertown
Linguistic Theories, Cognitive Modeling, and Psycholinguistics (Poster)
Room: Virtual-Gathertown
Machine Learning for NLP (Poster)
Room: Virtual-Gathertown
Multilinguality and Linguistic Diversity (Poster)
Room: Virtual-Gathertown
Natural Language Generation (Poster)
Room: Virtual-Gathertown
NLP Applications (Poster)
Room: Virtual-Gathertown
- Exploiting Contrastive Learning and Numerical Evidence for Confusing Legal Judgment Prediction
- Towards Detecting Contextual Real-Time Toxicity for In-Game Chat
- Mitigating Framing Bias with Polarity Minimization Loss
- The Truth, The Whole Truth, and Nothing but the Truth: A New Benchmark Dataset for Hebrew Text Credibility Assessment
- Are NLP Models Good at Tracing Thoughts: An Overview of Narrative Understanding
Phonology, Morphology, and Word Segmentation (Poster)
Room: Virtual-Gathertown
Question Answering (Poster)
Room: Virtual-Gathertown
Resources and Evaluation (Poster)
Room: Virtual-Gathertown
- CReTIHC: Designing Causal Reasoning Tasks about Temporal Interventions and Hallucinated Confoundings
- FREDSum: A Dialogue Summarization Corpus for French Political Debates
- The Vault: A Comprehensive Multilingual Dataset for Advancing Code Understanding and Generation
- Don't waste a single annotation: improving single-label classifiers through soft labels
- SYMPTOMIFY: Transforming Symptom Annotations with Language Model Knowledge Harvesting
Semantics: Lexical, Sentence level, Document Level, Textual Inference, etc. (Poster)
Room: Virtual-Gathertown
Sentiment Analysis, Stylistic Analysis, and Argument Mining (Poster)
Room: Virtual-Gathertown
Summarization (Poster)
Room: Virtual-Gathertown
Theme Track: Large Language Models and the Future of NLP (Poster)
Room: Virtual-Gathertown
- Blackbird language matrices (BLM), a new task for rule-like generalization in neural networks: Can Large Language Models pass the test?
- Thorny Roses: Investigating the Dual Use Dilemma in Natural Language Processing
- AutoPlan: Automatic Planning of Interactive Decision-Making Tasks With Large Language Models
- NERetrieve: Dataset for Next Generation Named Entity Recognition and Retrieval
- Sources of Hallucination by Large Language Models on Inference Tasks
Session 5
Oral Presentations
Efficient Methods for NLP 2 (Oral)
Room: West 2
- APrompt: Attention Prompt Tuning for Efficient Adaptation of Pre-trained Language Models
- Parameter-Efficient Language Model Tuning with Active Learning in Low-Resource Settings
- Merging Experts into One: Improving Computational Efficiency of Mixture of Experts
- Selective Labeling: How to Radically Lower Data-Labeling Costs for Document Extraction Models
- Focus Your Attention (with Adaptive IIR Filters)
- Let's Sample Step by Step: Adaptive-Consistency for Efficient Reasoning and Coding with LLMs
Human-Centered NLP (Oral)
Room: West 3
- Modeling Empathic Similarity in Personal Narratives
- A Diachronic Perspective on User Trust in AI under Uncertainty
- Dr ChatGPT tell me what I want to hear: How different prompts impact health answer correctness
- Pre-Trained Language Models Augmented with Synthetic Scanpaths for Natural Language Understanding
- Generating and Evaluating Tests for K-12 Students with Language Model Simulations: A Case Study on Sentence Reading Efficiency
- "Fifty Shades of Bias": Normative Ratings of Gender Bias in GPT Generated English Text
Multilinguality and Linguistic Diversity 1 (Oral)
Room: East
- BasahaCorpus: An Expanded Linguistic Resource for Readability Assessment in Central Philippine Languages
- Translating away Translationese without Parallel Data
- Automatic Transcription of Handwritten Old Occitan Language
- Don’t Trust ChatGPT when your Question is not in English: A Study of Multilingual Abilities and Types of LLMs
- Revisiting Machine Translation for Cross-lingual Classification
- Language Representation Projection: Can We Transfer Factual Knowledge across Languages in Multilingual Language Models?
Natural Language Generation 1 (Oral)
Room: Central 1
- Structure-aware Knowledge Graph-to-text Generation with Planning Selection and Similarity Distinction
- Granularity Matters: Pathological Graph-driven Cross-modal Alignment for Brain CT Report Generation
- Improving Image Captioning via Predicting Structured Concepts
- CodeFusion: A Pre-trained Diffusion Model for Code Generation
- Look-back Decoding for Open-Ended Text Generation
- Measuring Attribution in Natural Language Generation Models
NLP Applications 1 (Oral)
Room: Central 3
- Learning Co-Speech Gesture for Multimodal Aphasia Type Detection
- ClimateBERT-NetZero: Detecting and Assessing Net Zero and Reduction Targets
- Advancements in Arabic Grammatical Error Detection and Correction: An Empirical Investigation
- Detection of Multiple Mental Disorders from Social Media with Two-Stream Psychiatric Experts
- Hidding the Ghostwriters: An Adversarial Evaluation of AI-Generated Student Essay Detection
- Learning the Visualness of Text Using Large Vision-Language Models
Theme Track: Large Language Models and the Future of NLP 1 (Oral)
Room: West 1
- Fighting Fire with Fire: The Dual Role of LLMs in Crafting and Detecting Elusive Disinformation
- Reasoning with Language Model is Planning with World Model
- Stop Uploading Test Data in Plain Text: Practical Strategies for Mitigating Data Contamination by Evaluation Benchmarks
- Prompting is not a substitute for probability measurements in large language models
- FANToM: A Benchmark for Stress-testing Machine Theory of Mind in Interactions
- LLMLingua: Compressing Prompts for Accelerated Inference of Large Language Models
Session 6
Oral Presentations
Interpretability, Interactivity, and Analysis of Models for NLP 2 (Oral)
Room: East
- Absolute Position Embedding Learns Sinusoid-like Waves for Attention Based on Relative Position
- Statistical Depth for Ranking and Characterizing Transformer-Based Text Embeddings
- Explaining Interactions Between Text Spans
- Bridging Information-Theoretic and Geometric Compression in Language Models
- What Comes Next? Evaluating Uncertainty in Neural Text Generators Against Human Production Variability
- Data Factors for Better Compositional Generalization
Language Modeling and Analysis of Language Models 2 (Oral)
Room: Central 1
- Inverse Scaling Can Become U-Shaped
- Revisiting Instruction Fine-tuned Model Evaluation to Guide Industrial Applications
- FinGPT: Large Generative Models for a Small Language
- Consistency Analysis of ChatGPT
- How Abstract Is Linguistic Generalization in Large Language Models? Experiments with Argument Structure
- How is a “Kitchen Chair” like a “Farm Horse”? Exploring the Representation of Noun-Noun Compound Semantics in Transformer-based Language Models
Multilinguality and Linguistic Diversity 2 (Oral)
Room: Central 3
- Multilingual Large Language Models Are Not (Yet) Code-Switchers
- Cross-lingual Prompting: Improving Zero-shot Chain-of-Thought Reasoning across Languages
- Investigating Bias in Multilingual Language Models: Cross-Lingual Transfer of Debiasing Techniques
- FOCUS: Effective Embedding Initialization for Monolingual Specialization of Multilingual Models
- GPTAraEval: A Comprehensive Evaluation of ChatGPT on Arabic NLP
- Shared Lexical Items as Triggers of Code Switching
Natural Language Generation 2 (Oral)
Room: West 1
- MULTITuDE: Large-Scale Multilingual Machine-Generated Text Detection Benchmark
- Active Learning for Natural Language Generation
- Interactive Text Generation
- INSTRUCTSCORE: Towards Explainable Text Generation Evaluation with Automatic Feedback
- Pre-training Language Models for Comparative Reasoning
- Composable Text Controls in Latent Space with ODEs
Question Answering (Oral)
Room: West 2
- Compressing and Debiasing Vision-Language Pre-Trained Models for Visual Question Answering
- Merging Generated and Retrieved Knowledge for Open-Domain QA
- Diversity Enhanced Narrative Question Generation for Storybooks
- The Art of SOCRATIC QUESTIONING: Recursive Thinking with Large Language Models
- Once Upon a ${\it Time}$ in ${\it Graph}$: Relative-Time Pretraining for Complex Temporal Reasoning
- On the Robustness of Dialogue History Representation in Conversational Question Answering: A Comprehensive Study and a New Prompt-based Method
Resources and Evaluation 1 (Oral)
Room: West 3
- HaluEval: A Large-Scale Hallucination Evaluation Benchmark for Large Language Models
- TRIGO: Benchmarking Formal Mathematical Proof Reduction for Generative Language Models
- BanglaAbuseMeme: A Dataset for Bengali Abusive Meme Classification
- IDTraffickers: An Authorship Attribution Dataset to link and connect Potential Human-Trafficking Operations on Text Escort Advertisements
- This is not a Dataset: A Large Negation Benchmark to Challenge Large Language Models
- Do LLMs Understand Social Knowledge? Evaluating the Sociability of Large Language Models with SocKET Benchmark
Timezone: Conference (Singapore) UTC Browser
Poster_Demo_Industry Hybrid 6
Poster Presentations
Computational Social Science and Cultural Analytics (Poster)
Room: East Foyer(Virtual)
Industry (Poster)
Room: East Foyer(Virtual)
Information Retrieval and Text Mining (Poster)
Room: East Foyer(Virtual)
Machine Learning for NLP (Poster)
Room: East Foyer(Virtual)
- PTP: Boosting Stability and Performance of Prompt Tuning with Perturbation-Based Regularizer
- Chain-of-Thought Tuning: Masked Language Models can also Think Step By Step in Natural Language Understanding
- Zero-shot Sharpness-Aware Quantization for Pre-trained Language Models
- Practical Computational Power of Linear Transformers and Their Recurrent and Self-Referential Extensions
- Ling-CL: Understanding NLP Models through Linguistic Curricula
- PAC-tuning: Fine-tuning Pre-trained Language Models with PAC-driven Perturbed Gradient Descent
- An End-to-End Contrastive Self-Supervised Learning Framework for Language Understanding
NLP Applications (Poster)
Room: East Foyer(Virtual)
- STINMatch: Semi-Supervised Semantic-Topological Iteration Network for Financial Risk Detection via News Label Diffusion
- Clinical Contradiction Detection
- Learning Knowledge-Enhanced Contextual Language Representations for Domain Natural Language Understanding
- BioFEG: Generate Latent Features for Biomedical Entity Linking
- Hierarchical Pretraining on Multimodal Electronic Health Records
- Multi-Task Knowledge Distillation with Embedding Constraints for Scholarly Keyphrase Boundary Classification
- CodeBERTScore: Evaluating Code Generation with Pretrained Models of Code
- Joint Geometrical and Statistical Domain Adaptation for Cross-domain Code Vulnerability Detection
- ATHENA: Mathematical Reasoning with Thought Expansion
- Federated Meta-Learning for Emotion and Sentiment Aware Multi-modal Complaint Identification
Poster_Demo_Industry Hybrid 7
Poster Presentations
Demo (Poster)
Room: East Foyer(Virtual)
Industry (Poster)
Room: East Foyer(Virtual)
- AdaBERT-CTC: Leveraging BERT-CTC for Text-Only Domain Adaptation in ASR
- An Auxiliary Task Boosted Multi-task Learning Method for Service Account Retrieval with Limited Human Annotation
- Compute-Efficient Churn Reduction for Conversational Agents
- Coordinated Replay Sample Selection for Continual Federated Learning
- Does Named Entity Recognition Truly Not Scale Up to Real-world Product Attribute Extraction?
- E2E Spoken Entity Extraction for Virtual Agents
- Empower Large Language Model to Perform Better on Industrial Domain-Specific Question Answering
- Improving Contextual Query Rewrite for Conversational AI Agents through User-preference Feedback Learning
- InsightNet : Structured Insight Mining from Customer Feedback
- KD-Boost: Boosting Real-Time Semantic Matching in E-commerce with Knowledge Distillation
- Multi-teacher Distillation for Multilingual Spelling Correction
- MUST&P-SRL: Multi-lingual and Unified Syllabification in Text and Phonetic Domains for Speech Representation Learning
- On Sample-Efficient Code Generation
- Relevance-assisted Generation for Robust Zero-shot Retrieval
- Retrieve and Copy: Scaling ASR Personalization to Large Catalogs
- Too much of product information : Don't worry, let's look for evidence!
- Unveiling Identity Biases in Toxicity Detection : A Game-Focused Dataset and Reactivity Analysis Approach
- ViGPTQA - State-of-the-Art LLMs for Vietnamese Question Answering: System Overview, Core Models Training, and Evaluations
- VKIE: The Application of Key Information Extraction on Video Text
Semantics: Lexical, Sentence level, Document Level, Textual Inference, etc. (Poster)
Room: East Foyer(Virtual)
Poster_Demo_Industry_Findings In-person 6
Poster Presentations
Commonsense Reasoning (Poster)
Room: East Foyer
Computational Social Science and Cultural Analytics (Poster)
Room: East Foyer
- Vicarious Offense and Noise Audit of Offensive Speech Classifiers: Unifying Human and Machine Disagreement on What is Offensive
- Speak, Memory: An Archaeology of Books Known to ChatGPT/GPT-4
- CoSyn: Detecting Implicit Hate Speech in Online Conversations Using a Context Synergized Hyperbolic Network
- Toward a Critical Toponymy Framework for Named Entity Recognition: A Case Study of Airbnb in New York City
- Decoding the Silent Majority: Inducing Belief Augmented Social Graph with Large Language Model for Response Forecasting
- From Values to Opinions: Predicting Human Behaviors and Stances Using Value-Injected Large Language Models
- PHD: Pixel-Based Language Modeling of Historical Documents
- Sociocultural Norm Similarities and Differences via Situational Alignment and Explainable Textual Entailment
- MingOfficial: A Ming Official Career Dataset and a Historical Context-Aware Representation Learning Framework
- VIBE: Topic-Driven Temporal Adaptation for Twitter Classification
- NORMSAGE: Multi-Lingual Multi-Cultural Norm Discovery from Conversations On-the-Fly
- People Make Better Edits: Measuring the Efficacy of LLM-Generated Counterfactually Augmented Data for Harmful Language Detection
- Analysing State-Backed Propaganda Websites: a New Dataset and Linguistic Study
- Multilingual estimation of political-party positioning: From label aggregation to long-input Transformers
- Natural Language Decompositions of Implicit Content Enable Better Text Representations
- Cross-Cultural Analysis of Human Values, Morals, and Biases in Folk Tales
- A Digital Language Coherence Marker for Monitoring Dementia
- TalkUp: Paving the Way for Understanding Empowering Language
- Beneath the Surface: Unveiling Harmful Memes with Multimodal Reasoning Distilled from Large Language Models
- More than Votes? Voting and Language based Partisanship in the US Supreme Court
- C2D2 Dataset: A Resource for the Cognitive Distortion Analysis and Its Impact on Mental Health
Demo (Poster)
Room: East Foyer
- Fabricator: An Open Source Toolkit for Generating Labeled Training Data with Teacher LLMs
- End-to-End Evaluation for Low-Latency Simultaneous Speech Translation
- Gentopia.AI: A Collaborative Platform for Tool-Augmented LLMs
- SentAlign: Accurate and Scalable Sentence Alignment
- QACheck: A Demonstration System for Question-Guided Multi-Hop Fact-Checking
- Kandinsky: an Improved Text-to-Image Synthesis with Image Prior and Latent Diffusion
- NewsRecLib: A PyTorch-Lightning Library for Neural News Recommendation
- MiniChain: A Small Library for Coding with Large Language Models
Discourse and Pragmatics (Poster)
Room: East Foyer
Ethics in NLP (Poster)
Room: East Foyer
Information Extraction (Poster)
Room: East Foyer
Information Retrieval and Text Mining (Poster)
Room: East Foyer
- Query-as-context Pre-training for Dense Passage Retrieval
- CaseEncoder: A Knowledge-enhanced Pre-trained Model for Legal Case Encoding
- Modeling Conceptual Attribute Likeness and Domain Inconsistency for Metaphor Detection
- Longtriever: a Pre-trained Long Text Encoder for Dense Document Retrieval
- DREAM: Deployment of Recombination and Ensembles in Argument Mining
- Instructed Language Models with Retrievers Are Powerful Entity Linkers
- Rethinking Negative Pairs in Code Search
- Enhancing Generative Retrieval with Reinforcement Learning from Relevance Feedback
- How Does Generative Retrieval Scale to Millions of Passages?
- Expand, Highlight, Generate: RL-driven Document Generation for Passage Reranking
- mAggretriever: A Simple yet Effective Approach to Zero-Shot Multilingual Dense Retrieval
- GLEN: Generative Retrieval via Lexical Index Learning
- NAIL: Lexical Retrieval Indices with Efficient Non-Autoregressive Decoders
- Once is Enough: A Light-Weight Cross-Attention for Fast Sentence Pair Modeling
- Cross-Lingual Cross-Target Stance Detection with Dual Knowledge Distillation Framework
- ClusterLLM: Large Language Models as a Guide for Text Clustering
- Poisoning Retrieval Corpora by Injecting Adversarial Passages
- PIEClass: Weakly-Supervised Text Classification with Prompting and Noise-Robust Iterative Ensemble Training
- Enhancing the Ranking Context of Dense Retrieval through Reciprocal Nearest Neighbors
- Semantic Similarity Models for Depression Severity Estimation
- Reasoning over Public and Private Data in Retrieval-Based Systems
- Improving Multitask Retrieval by Promoting Task Specialization
Interpretability, Interactivity, and Analysis of Models for NLP (Poster)
Room: East Foyer
Language Grounding to Vision, Robotics and Beyond (Poster)
Room: East Foyer
Machine Learning for NLP (Poster)
Room: East Foyer
- FLatS: Principled Out-of-Distribution Detection with Feature-Based Likelihood Ratio Score
- Make Every Example Count: On the Stability and Utility of Self-Influence for Learning from Noisy NLP Datasets
- Empower Nested Boolean Logic via Self-Supervised Curriculum Learning
- Transductive Learning for Textual Few-Shot Classification in API-based Embedding Models
- Large-scale similarity search with Optimal Transport
- A linear time approximation of Wasserstein distance with word embedding selection
- The Distributional Hypothesis Does Not Fully Explain the Benefits of Masked Language Model Pretraining
- Out-of-Distribution Generalization in Natural Language Processing: Past, Present, and Future
- Multi-view Contrastive Learning for Entity Typing over Knowledge Graphs
- TaskWeb: Selecting Better Source Tasks for Multi-task NLP
- Mitigating Backdoor Poisoning Attacks through the Lens of Spurious Correlation
- Tree Prompting: Efficient Task Adaptation without Fine-Tuning
- FedID: Federated Interactive Distillation for Large-Scale Pretraining Language Models
- JointMatch: A Unified Approach for Diverse and Collaborative Pseudo-Labeling to Semi-Supervised Text Classification
- Efficient Grammatical Error Correction Via Multi-Task Training and Optimized Training Schedule
- GradSim: Gradient-Based Language Grouping for Effective Multilingual Training
- DNA: Denoised Neighborhood Aggregation for Fine-grained Category Discovery
- Inference-Time Policy Adapters (IPA): Tailoring Extreme-Scale LMs without Fine-tuning
- Contrastive Learning of Sentence Embeddings from Scratch
- Uncertainty Guided Global Memory Improves Multi-Hop Question Answering
- Debiasing Made State-of-the-art: Revisiting the Simple Seed-based Weak Supervision for Text Classification
- Meta-Learning Online Adaptation of Language Models
- ULF: Unsupervised Labeling Function Correction using Cross-Validation for Weak Supervision
- Using Interpretation Methods for Model Enhancement
- DisCo: Distilled Student Models Co-training for Semi-supervised Text Mining
- Relation-aware Ensemble Learning for Knowledge Graph Embedding
- Erasure of Unaligned Attributes from Neural Representations
Machine Translation (Poster)
Room: East Foyer
Natural Language Generation (Poster)
Room: East Foyer
- Simplicity Level Estimate (SLE): A Learned Reference-Less Metric for Sentence Simplification
- A Quality-based Syntactic Template Retriever for Syntactically-Controlled Paraphrase Generation
- Critic-Driven Decoding for Mitigating Hallucinations in Data-to-text Generation
- PromptMix: A Class Boundary Augmentation Method for Large Language Model Distillation
- CP-BCS: Binary Code Summarization Guided by Control Flow Graph and Pseudo Code
- HistAlign: Improving Context Dependency in Language Generation by Aligning with History
- Grammar-Constrained Decoding for Structured NLP Tasks without Finetuning
- Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model
- Harnessing Black-Box Control to Boost Commonsense in LM's Generation
- JASMINE: Arabic GPT Models for Few-Shot Learning
- Hallucination Mitigation in Natural Language Generation from Large-Scale Open-Domain Knowledge Graphs
- Expository Text Generation: Imitate, Retrieve, Paraphrase
- On the Automatic Generation and Simplification of Children's Stories
- Unifying Discrete and Continuous Representations for Unsupervised Paraphrase Generation
- Learning to Rank Generation with Pairwise Partial Rewards
NLP Applications (Poster)
Room: East Foyer
- Rethinking Model Selection and Decoding for Keyphrase Generation with Pre-trained Sequence-to-Sequence Models
- RepoCoder: Repository-Level Code Completion Through Iterative Retrieval and Generation
- NameGuess: Column Name Expansion for Tabular Data
- Temporal Knowledge Graph Forecasting Without Knowledge Using In-Context Learning
- What to Read in a Contract? Party-Specific Summarization of Legal Obligations, Entitlements, and Prohibitions
- CoAnnotating: Uncertainty-Guided Work Allocation between Human and Large Language Models for Data Annotation
- A Scalable Framework for Table of Contents Extraction from Complex ESG Annual Reports
- Unlearn What You Want to Forget: Efficient Unlearning for LLMs
- A Cheaper and Better Diffusion Language Model with Soft-Masked Noise
- GeoLM: Empowering Language Models for Geospatially Grounded Language Understanding
- Semantic matching for text classification with complex class descriptions
- Evaluating Cross-Domain Text-to-SQL Models and Benchmarks
- Enhancing Structured Evidence Extraction for Fact Verification
- System Combination via Quality Estimation for Grammatical Error Correction
- Reducing Sequence Length by Predicting Edit Spans with Large Language Models
- Beware of Model Collapse! Fast and Stable Test-time Adaptation for Robust Question Answering
- Revisiting the Knowledge Injection Frameworks
- Non-autoregressive Text Editing with Copy-aware Latent Alignments
- Unsupervised Grammatical Error Correction Rivaling Supervised Methods
- FAME: Flexible, Scalable Analogy Mappings Engine
- CoCo: Coherence-Enhanced Machine-Generated Text Detection Under Low Resource With Contrastive Learning
- Not all Fake News is Written: A Dataset and Analysis of Misleading Video Headlines
- Automated Fact-Checking in Dialogue: Are Specialized Models Needed?
- Exploring Distributional Shifts in Large Language Models for Code Analysis
- V.\tIntroduction to Mathematical Language Processing: Informal Proofs, Word Problems, and Supporting Tasks
- Pre-train, Prompt and Recommendation: A Comprehensive Survey of Language Modelling Paradigm Adaptations in Recommender Systems
- Data Augmentation for Code Translation with Comparable Corpora and Multiple References
- Evaluating and Enhancing the Robustness of Code Pre-trained Models through Structure-Aware Adversarial Samples Generation
- Aligning Language Models to User Opinions
Question Answering (Poster)
Room: East Foyer
Speech and Multimodality (Poster)
Room: East Foyer
Theme Track: Large Language Models and the Future of NLP (Poster)
Room: East Foyer
Poster_Demo_Industry_Findings In-person 7
Poster Presentations
Computational Social Science and Cultural Analytics (Poster)
Room: East Foyer
- The PEACE-Reviews dataset: Modeling Cognitive Appraisals in Emotion Text Analysis
- Modeling Highlighting of Metaphors in Multitask Contrastive Learning Paradigms
- Beyond Denouncing Hate: Strategies for Countering Implied Biases and Stereotypes in Language
- Dimensions of Online Conflict: Towards Modeling Agonism
Demo (Poster)
Room: East Foyer
- Reaction Miner: An Integrated System for Chemical Reaction Extraction from Textual Data
- Prompt2Model: Generating Deployable Models from Natural Language Instructions
- NewsSense: Reference-free Verification via Cross-document Comparison
- CLEVA: Chinese Language Models EVAluation Platform
- CocoSciSum: A Scientific Summarization Toolkit with Compositional Controllability
Dialogue and Interactive Systems (Poster)
Room: East Foyer
- Enhancing Task-oriented Dialogue Systems with Generative Post-processing Networks
- Large Language Models Meet Harry Potter: A Dataset for Aligning Dialogue Agents with Characters
- Logic Unveils Truth, While Disguise Obscures It: Transition Logic Augmented Response Selection for Multi-Turn Dialogue
- Aligning Predictive Uncertainty with Clarification Questions in Grounded Dialog
- Long-Horizon Dialogue Understanding for Role Identification in the Game of Avalon with Large Language Models
- FFAEval: Evaluating Dialogue System via Free-For-All Ranking
- PCMID: Multi-Intent Detection through Supervised Prototypical Contrastive Learning
- xDial-Eval: A Multilingual Open-Domain Dialogue Evaluation Benchmark
- RefGPT: Dialogue Generation of GPT, by GPT, and for GPT
Efficient Methods for NLP (Poster)
Room: East Foyer
- NASH: A Simple Unified Framework of Structured Pruning for Accelerating Encoder-Decoder Language Models
- Survival of the Most Influential Prompts: Efficient Black-Box Prompt Search via Clustering and Pruning
- Beyond Layout Embedding: Layout Attention with Gaussian Biases for Structured Document Understanding
- EffEval: A Comprehensive Evaluation of Efficiency for MT Evaluation Metrics
- MEAL: Stable and Active Learning for Few-Shot Prompting
Ethics in NLP (Poster)
Room: East Foyer
Human-Centered NLP (Poster)
Room: East Foyer
Industry (Poster)
Room: East Foyer
- A Comparative Analysis of Task-Agnostic Distillation Methods for Compressing Transformer Language Models
- AART: AI-Assisted Red-Teaming with Diverse Data Generation for New LLM-powered Applications
- AdapterDistillation: Non-Destructive Task Composition with Knowledge Distillation
- Adaptive Hyper-parameter Learning for Deep Semantic Retrieval
- Are ChatGPT and GPT-4 General-Purpose Solvers for Financial Text Analytics? A Study on Several Typical Tasks
- Batch Prompting: Efficient Inference with Large Language Model APIs
- BeautifulPrompt: Towards Automatic Prompt Engineering for Text-to-Image Synthesis
- CarExpert: Leveraging Large Language Models for In-Car Conversational Question Answering
- Conversing with databases: Practical Natural Language Querying
- Creator Context for Tweet Recommendation
- Deep Metric Learning to Hierarchically Rank - An Application in Product Retrieval
- DELPHI: Data for Evaluating LLMs' Performance in Handling Controversial Issues
- DocumentNet: Bridging the Data Gap in Document Pre-training
- DUBLIN: Visual Document Understanding By Language-Image Network
- EELBERT: Tiny Models through Dynamic Embeddings
- Enhancing Extreme Multi-Label Text Classification: Addressing Challenges in Model, Data, and Evaluation
- Enhancing Language Model with Unit Test Techniques for Efficient Regular Expression Generation
- Gatekeeper to save COGS and improve efficiency of Text Prediction
- Generative Models for Product Attribute Extraction
- Gold Standard Bangla OCR Dataset: An In-Depth Look at Data Preprocessing and Annotation Processes
- Graph Meets LLM: A Novel Approach to Collaborative Filtering for Robust Conversational Understanding
- Harnessing LLMs for Temporal Data - A Study on Explainable Financial Time Series Forecasting
- InstructPTS: Instruction-Tuning LLMs for Product Title Summarization
- Investigating Table-to-Text Generation Capabilities of Large Language Models in Real-World Information Seeking Scenarios
- JarviX: A LLM No code Platform for Tabular Data Analysis and Optimization
- Joint Dialogue Topic Segmentation and Categorization: A Case Study on Clinical Spoken Conversations
- Learning Multilingual Sentence Representations with Cross-lingual Consistency Regularization
- Multi-word Tokenization for Sequence Compression
- ORANGE: Text-video Retrieval via Watch-time-aware Heterogeneous Graph Contrastive Learning
- Personalized Dense Retrieval on Global Index for Voice-enabled Conversational Systems
- Query-aware Multi-modal based Ranking Relevance in Video Search
- Retrieval-Enhanced Dual Encoder Training for Product Matching
- SAMP: A Model Inference Toolkit of Post-Training Quantization for Text Processing via Self-Adaptive Mixed-Precision
- Scaling Neural ITN for Numbers and Temporal Expressions in Tamil: Findings for an Agglutinative Low-resource Language
- Speakerly: A Voice-based Writing Assistant for Text Composition
- STEER: Semantic Turn Extension-Expansion Recognition for Voice Assistants
- Text2Topic: Multi-Label Text Classification System for Efficient Topic Detection in User Generated Content with Zero-Shot Capabilities
- TMID: A Comprehensive Real-world Dataset for Trademark Infringement Detection in E-Commerce
- Towards Effective Automatic Debt Collection with Persona Awareness
- Welcome to the Real World: Efficient, Incremental and Scalable Key Point Analysis
- WordArt Designer: User-Driven Artistic Typography Synthesis using Large Language Models
Information Extraction (Poster)
Room: East Foyer
- GSAP-NER: A Novel Task, Corpus, and Baseline for Scholarly Entity Extraction Focused on Machine Learning Models and Datasets
- Prompting ChatGPT in MNER: Enhanced Multimodal Named Entity Recognition with Auxiliary Refined Knowledge
- Structure and Label Constrained Data Augmentation for Cross-domain Few-shot NER
Information Retrieval and Text Mining (Poster)
Room: East Foyer
- MEGClass: Extremely Weakly Supervised Text Classification via Mutually-Enhancing Text Granularities
- Connecting the Dots: What Graph-Based Text Representations Work Best for Text Classification using Graph Neural Networks?
- Large Language Models Know Your Contextual Search Intent: A Prompting Framework for Conversational Search
- Topic-DPR: Topic-based Prompts for Dense Passage Retrieval
Interpretability, Interactivity, and Analysis of Models for NLP (Poster)
Room: East Foyer
Language Grounding to Vision, Robotics and Beyond (Poster)
Room: East Foyer
Language Modeling and Analysis of Language Models (Poster)
Room: East Foyer
- Logic-LM: Empowering Large Language Models with Symbolic Solvers for Faithful Logical Reasoning
- Are Language Models Worse than Humans at Following Prompts? It's Complicated
- TRAMS: Training-free Memory Selection for Long-range Language Modeling
- A New Benchmark and Reverse Validation Method for Passage-level Hallucination Detection
- Impact of Co-occurrence on Factual Knowledge of Large Language Models
- Locally Differentially Private Document Generation Using Zero Shot Prompting
Linguistic Theories, Cognitive Modeling, and Psycholinguistics (Poster)
Room: East Foyer
Machine Learning for NLP (Poster)
Room: East Foyer
Machine Translation (Poster)
Room: East Foyer
Multilinguality and Linguistic Diversity (Poster)
Room: East Foyer
- Improving Cross-lingual Transfer through Subtree-aware Word Reordering
- Crosslingual Transfer Learning for Low-Resource Languages Based on Multilingual Colexification Graphs
- PAXQA: Generating Cross-lingual Question Answering Examples at Training Scale
- Data Augmentation Techniques for Machine Translation of Code-Switched Texts: A Comparative Study
Natural Language Generation (Poster)
Room: East Foyer
- GTA: Gated Toxicity Avoidance for LM Performance Preservation
- Explain-then-translate: an analysis on improving program translation with self-generated explanations
- MacLaSa: Multi-Aspect Controllable Text Generation via Efficient Sampling from Compact Latent Space
- A Table-to-Text Framework with Heterogeneous Multidominance Attention and Self-Evaluated Multi-Pass Deliberation
NLP Applications (Poster)
Room: East Foyer
- $\textbf{\emph{CLMSM}}$: A Multi-Task Learning Framework for Pre-training on Procedural Text
- Universal Domain Adaptation for Robust Handling of Distributional Shifts in NLP
- Cache me if you Can: an Online Cost-aware Teacher-Student framework to Reduce the Calls to Large Language Models
- Robustness of Named-Entity Replacements for In-Context Learning
- GPT Deciphering Fedspeak: Quantifying Dissent Among Hawks and Doves
- Improving Seq2Seq Grammatical Error Correction via Decoding Interventions
- ExplainCPE: A Free-text Explanation Benchmark of Chinese Pharmacist Examination
- VERVE: Template-based ReflectiVE Rewriting for MotiVational IntErviewing
- ChatCoT: Tool-Augmented Chain-of-Thought Reasoning on Chat-based Large Language Models
- SQLPrompt: In-Context Text-to-SQL with Minimal Labeled Data
Question Answering (Poster)
Room: East Foyer
Resources and Evaluation (Poster)
Room: East Foyer
- Large Language Models are biased to overestimate profoundness
- HANSEN: Human and AI Spoken Text Benchmark for Authorship Analysis
- DeltaScore: Fine-Grained Story Evaluation with Perturbations
- ClozEx: A Task toward Generation of English Cloze Explanation
- Large Language Models are Not Yet Human-Level Evaluators for Abstractive Summarization
- Frequency Balanced Datasets Lead to Better Language Models
- DiFair: A Benchmark for Disentangled Assessment of Gender Knowledge and Bias
- Beneath Surface Similarity: Large Language Models Make Reasonable Scientific Analogies after Structure Abduction
- A Novel Contrastive Learning Method for Clickbait Detection on RoCliCo: A Romanian Clickbait Corpus of News Articles
- Aksharantar: Open Indic-language Transliteration datasets and models for the Next Billion Users
Semantics: Lexical, Sentence level, Document Level, Textual Inference, etc. (Poster)
Room: East Foyer
Sentiment Analysis, Stylistic Analysis, and Argument Mining (Poster)
Room: East Foyer
Speech and Multimodality (Poster)
Room: East Foyer
- Automatic Pronunciation Assessment - A Review
- Multi-Modal Knowledge Graph Transformer Framework for Multi-Modal Entity Alignment
- Intuitive Multilingual Audio-Visual Speech Recognition with a Single-Trained Model
- Long-Form Speech Translation through Segmentation with Finite-State Decoding Constraints on Large Language Models
Summarization (Poster)
Room: East Foyer
Syntax, Parsing and their Applications (Poster)
Room: East Foyer
Theme Track: Large Language Models and the Future of NLP (Poster)
Room: East Foyer
- Quantifying the Dialect Gap and its Correlates Across Languages
- Pragmatics in Language Grounding: Phenomena, Tasks, and Modeling Approaches
- Drilling Down into the Discourse Structure with LLMs for Long Document Question Answering
- Ethical Reasoning over Moral Alignment: A Case and Framework for In-Context Ethical Policies in LLMs
- R$^3$ Prompting: Review, Rephrase and Resolve for Chain-of-Thought Reasoning in Large Language Models under Noisy Context
- Large Language Models Are Better Adversaries: Exploring Generative Clean-Label Backdoor Attacks Against Text Classifiers
- Conversational Recommender System and Large Language Model Are Made for Each Other in E-commerce Pre-sales Dialogue
- MenatQA: A New Dataset for Testing the Temporal Comprehension and Reasoning Abilities of Large Language Models
- NarrativeXL: a Large-scale Dataset for Long-Term Memory Models
- Beyond Good Intentions: Reporting the Research Landscape of NLP for Social Good
Poster_Demo_Industry_Findings Virtual 6
Poster Presentations
Computational Social Science and Cultural Analytics (Poster)
Room: Virtual-Gathertown
Dialogue and Interactive Systems (Poster)
Room: Virtual-Gathertown
- Social Commonsense-Guided Search Query Generation for Open-Domain Knowledge-Powered Conversations
- Enhancing Conversational Search: Large Language Model-Aided Informative Query Rewriting
- RSVP: Customer Intent Detection via Agent Response Contrastive and Generative Pre-Training
- Hierarchical Prompting Assists Large Language Model on Web Navigation
- Time-Considerable Dialogue Models via Reranking by Time Dependency
Efficient Methods for NLP (Poster)
Room: Virtual-Gathertown
Information Extraction (Poster)
Room: Virtual-Gathertown
Information Retrieval and Text Mining (Poster)
Room: Virtual-Gathertown
Language Grounding to Vision, Robotics and Beyond (Poster)
Room: Virtual-Gathertown
Language Modeling and Analysis of Language Models (Poster)
Room: Virtual-Gathertown
Machine Learning for NLP (Poster)
Room: Virtual-Gathertown
Machine Translation (Poster)
Room: Virtual-Gathertown
Multilinguality and Linguistic Diversity (Poster)
Room: Virtual-Gathertown
- Unleashing the Multilingual Encoder Potential: Boosting Zero-Shot Performance via Probability Calibration
- In What Languages are Generative Language Models the Most Formal? Analyzing Formality Distribution across Languages
- Interpreting Indirect Answers to Yes-No Questions in Multiple Languages
- Evaluating Emotion Arcs Across Languages: Bridging the Global Divide in Sentiment Analysis
- BERTwich: Extending BERT’s Capabilities to Model Dialectal and Noisy Text
Natural Language Generation (Poster)
Room: Virtual-Gathertown
NLP Applications (Poster)
Room: Virtual-Gathertown
- Focus on the Core: Efficient Attention via Pruned Token Compression for Document Classification
- Can Foundation Models Watch, Talk and Guide You Step by Step to Make a Cake?
- CLASS: A Design Framework for Building Intelligent Tutoring Systems Based on Learning Science principles
- BotPercent: Estimating Bot Populations in Twitter Communities
Question Answering (Poster)
Room: Virtual-Gathertown
Resources and Evaluation (Poster)
Room: Virtual-Gathertown
- HeQ: a Large and Diverse Hebrew Reading Comprehension Benchmark
- NEWTON: Are Large Language Models Capable of Physical Reasoning?
- GPT-4 as an Effective Zero-Shot Evaluator for Scientific Figure Captions
- Conic10K: A Challenging Math Problem Understanding and Reasoning Dataset
- A Parallel Corpus for Vietnamese Central-Northern Dialect Text Transfer
Semantics: Lexical, Sentence level, Document Level, Textual Inference, etc. (Poster)
Room: Virtual-Gathertown
Speech and Multimodality (Poster)
Room: Virtual-Gathertown
- InvGC: Robust Cross-Modal Retrieval by Inverse Graph Convolution
- PersonaLM: Language Model Personalization via Domain-distributed Span Aggregated K-Nearest N-gram Retrieval Augmentation
- Handshape-Aware Sign Language Recognition: Extended Datasets and Exploration of Handshape-Inclusive Methods
- Video-Text Retrieval by Supervised Sparse Multi-Grained Learning
- Sound of Story: Multi-modal Storytelling with Audio
Summarization (Poster)
Room: Virtual-Gathertown
Theme Track: Large Language Models and the Future of NLP (Poster)
Room: Virtual-Gathertown
- Estimating Large Language Model Capabilities without Labeled Test Data
- The Less the Merrier? Investigating Language Representation in Multilingual Models
- You Are An Expert Linguistic Annotator: Limits of LLMs as Analyzers of Abstract Meaning Representation
- A Closer Look into Using Large Language Models for Automatic Evaluation
- RobustEmbed: Robust Sentence Embeddings Using Self-Supervised Contrastive Pre-Training
Poster_Demo_Industry_Findings Virtual 7
Poster Presentations
Commonsense Reasoning (Poster)
Room: Virtual-Gathertown
Computational Social Science and Cultural Analytics (Poster)
Room: Virtual-Gathertown
Dialogue and Interactive Systems (Poster)
Room: Virtual-Gathertown
Discourse and Pragmatics (Poster)
Room: Virtual-Gathertown
Efficient Methods for NLP (Poster)
Room: Virtual-Gathertown
- Learning to love diligent trolls: Accounting for rater effects in the dialogue safety task
- Pit One Against Many: Leveraging Attention-head Embeddings for Parameter-efficient Multi-head Attention
- FaLA: Fast Linear Adaptation for Replacing Backbone Models on Edge Devices
- Data Pruning for Efficient Model Pruning in Neural Machine Translation
- Efficient Cross-Task Prompt Tuning for Few-Shot Conversational Emotion Recognition
Ethics in NLP (Poster)
Room: Virtual-Gathertown
Information Extraction (Poster)
Room: Virtual-Gathertown
Information Retrieval and Text Mining (Poster)
Room: Virtual-Gathertown
Interpretability, Interactivity, and Analysis of Models for NLP (Poster)
Room: Virtual-Gathertown
Language Modeling and Analysis of Language Models (Poster)
Room: Virtual-Gathertown
Linguistic Theories, Cognitive Modeling, and Psycholinguistics (Poster)
Room: Virtual-Gathertown
Machine Learning for NLP (Poster)
Room: Virtual-Gathertown
Machine Translation (Poster)
Room: Virtual-Gathertown
Multilinguality and Linguistic Diversity (Poster)
Room: Virtual-Gathertown
Natural Language Generation (Poster)
Room: Virtual-Gathertown
NLP Applications (Poster)
Room: Virtual-Gathertown
- Can you Summarize my learnings? Towards Perspective-based Educational Dialogue Summarization
- A Frustratingly Easy Plug-and-Play Detection-and-Reasoning Module for Chinese Spelling Check
- KG-GPT: A General Framework for Reasoning on Knowledge Graphs Using Large Language Models
- A Comprehensive Evaluation of Large Language Models on Legal Judgment Prediction
- The Law and NLP: Bridging Disciplinary Disconnects
- Eyes Show the Way: Modelling Gaze Behaviour for Hallucination Detection
- Re-Temp: Relation-Aware Temporal Representation Learning for Temporal Knowledge Graph Completion
Question Answering (Poster)
Room: Virtual-Gathertown
Resources and Evaluation (Poster)
Room: Virtual-Gathertown
- INVITE: a Testbed of Automatically Generated Invalid Questions to Evaluate Large Language Models for Hallucinations
- EMO-KNOW: A Large Scale Dataset on Emotion-Cause
- AniEE: A Dataset of Animal Experimental Literature for Event Extraction
- Unlocking the Heterogeneous Landscape of Big Data NLP with DUUI
- CR-COPEC: Causal Rationale of Corporate Performance Changes to learn from Financial Reports
- BERT Goes Off-Topic: Investigating the Domain Transfer Challenge using Genre Classification
Semantics: Lexical, Sentence level, Document Level, Textual Inference, etc. (Poster)
Room: Virtual-Gathertown
Sentiment Analysis, Stylistic Analysis, and Argument Mining (Poster)
Room: Virtual-Gathertown
Speech and Multimodality (Poster)
Room: Virtual-Gathertown
Syntax, Parsing and their Applications (Poster)
Room: Virtual-Gathertown
Theme Track: Large Language Models and the Future of NLP (Poster)
Room: Virtual-Gathertown
- Pseudointelligence: A Unifying Lens on Language Model Evaluation
- ToxicChat: Unveiling Hidden Challenges of Toxicity Detection in Real-World User-AI Conversation
- Knowledge Corpus Error in Question Answering
- LLMDet: A Third Party Large Language Models Generated Text Detection Tool
- TSTR: Target Similarity Tuning Meets the Real World
- Generative Calibration for In-context Learning
Session 10
Oral Presentations
Industry track (Oral)
Room: West 3
- Self-Criticism: Aligning Large Language Models with their Understanding of Helpfulness, Honesty, and Harmlessness
- PILLOW: Enhancing Efficient Instruction Fine-tuning via Prompt Matching
- Lattice Path Edit Distance: A Romanization-aware Edit Distance for Extracting Misspelling-Correction Pairs from Japanese Search Query Logs
- LLM4Vis: Explainable Visualization Recommendation using ChatGPT
- A Pretrained Language Model for Cyber Threat Intelligence
- Investigating the Role and Impact of Disfluency on Summarization
NLP Applications 2 (Oral)
Room: East
- UniMath: A Foundational and Multimodal Mathematical Reasoner
- Predictive Chemistry Augmented with Text Retrieval
- Precedent-Enhanced Legal Judgment Prediction with LLM and Domain-Model Collaboration
- Event-Location Tracking in Narratives: A Case Study on Holocaust Testimonies
- Analyzing Norm Violations in Live-Stream Chat
- ALCAP: Alignment-Augmented Music Captioner
Resources and Evaluation 2 (Oral)
Room: Central 1
- Unveiling the Essence of Poetry: Introducing a Comprehensive Dataset and Benchmark for Poem Summarization
- You Told Me That Joke Twice: A Systematic Investigation of Transferability and Robustness of Humor Detection Models
- It Ain't Over: A Multi-aspect Diverse Math Word Problem Dataset
- Syllogistic Reasoning for Legal Judgment Analysis
- TempTabQA: Temporal Question Answering for Semi-Structured Tables
- Multilingual Previously Fact-Checked Claim Retrieval
Semantics 2 (Oral)
Room: Central 3
- On Bilingual Lexicon Induction with Large Language Models
- Random Entity Quantization for Parameter-Efficient Compositional Knowledge Graph Representation
- Systematic word meta-sense extension
- Pragmatic Reasoning Unlocks Quantifier Semantics for Foundation Models
- Non-Programmers Can Label Programs Indirectly via Active Examples: A Case Study with Text-to-SQL
- Connecting degree and polarity: An artificial language learning study
Speech & Multimodality 2 (Oral)
Room: West 1
- Rethinking and Improving Multi-task Learning for End-to-end Speech Translation
- Unsupervised Sounding Pixel Learning
- Homophone Disambiguation Reveals Patterns of Context Mixing in Speech Transformers
- Whispering LLaMA: A Cross-Modal Generative Error Correction Framework for Speech Recognition
- Conversation Understanding using Relational Temporal Graph Neural Networks with Auxiliary Cross-Modality Interaction
- Visual Spatial Reasoning
Theme Track: Large Language Models and the Future of NLP 2 (Oral)
Room: West 2
- Lion: Adversarial Distillation of Proprietary Large Language Models
- EpiK-Eval: Evaluation for Language Models as Epistemic Models
- To Build Our Future, We Must Know Our Past: Contextualizing Paradigm Shifts in Natural Language Processing
- Large Language Models: The Need for Nuance in Current Debates and a Pragmatic Perspective on Understanding
- FreeAL: Towards Human-Free Active Learning in the Era of Large Language Models
- Large Language Models Only Pass Primary School Exams in Indonesia: A Comprehensive Test on IndoMMLU
Session 9
Oral Presentations
Machine Learning for NLP (Oral)
Room: West 2
- Explicit Planning Helps Language Models in Logical Reasoning
- Where to start? Analyzing the potential value of intermediate models
- Fair Text Classification with Wasserstein Independence
- Improving Bias Mitigation through Bias Experts in Natural Language Understanding
- DSI++: Updating Transformer Memory with New Documents
- Translate-and-Test Transfer Learning for Cross-Lingual Text Classification
Semantics 1 (Oral)
Room: East
- WiCE: Real-World Entailment for Claims in Wikipedia
- Understanding Computational Models of Semantic Change: New Insights from the Speech Community
- What do Deck Chairs and Sun Hats Have in Common? Uncovering Shared Properties in Large Concept Vocabularies
- AdaSent: Efficient Domain-Adapted Sentence Embeddings for Few-Shot Classification
- Ditto: A Simple and Efficient Approach to Improve Sentence Embeddings
- Improving Language Models’ Meaning Understanding and Consistency by Learning Conceptual Roles from Dictionary
Sentiment or Stylistic Analysis (Oral)
Room: Central 1
- Argument-based Detection and Classification of Fallacies in Political Debates
- Why Should This Article Be Deleted? Transparent Stance Detection in Multilingual Wikipedia Editor Discussions
- Identification of Multimodal Stance Towards Frames of Communication
- EXPLAIN, EDIT, GENERATE: Rationale-Sensitive Counterfactual Data Augmentation for Multi-hop Fact Verification
- Joyful: Joint Modality Fusion and Graph Contrastive Learning for Multimodal Emotion Recognition
- Can Authorship Representation Learning Capture Stylistic Features?
Speech & Multimodality 1 (Oral)
Room: Central 3
- A Video Is Worth 4096 Tokens: Verbalize Story Videos To Understand Them In Zero Shot
- Balance Act: Mitigating Hubness in Cross-Modal Retrieval with Query and Gallery Banks
- Three Stream Based Multi-level Event Contrastive Learning for Text-Video Event Extraction
- Reading Order Matters: Information Extraction from Visually-rich Documents by Token Path Prediction
- MultiTurnCleanup: A Benchmark for Multi-Turn Spoken Conversational Transcript Cleanup
- Large Language Models and Multimodal Retrieval for Visual Word Sense Disambiguation
Summarization (Oral)
Room: West 1
- Instructive Dialogue Summarization with Query Aggregations
- Investigating Efficiently Extending Transformers for Long Input Summarization
- Zero-shot Faithfulness Evaluation for Text Summarization with Foundation Language Model
- Indicative Summarization of Long Discussions
- Promoting Topic Coherence and Inter-Document Consorts in Multi-Document Summarization via Simplicial Complex and Sheaf Graph
- Length Does Matter: Summary Length can Bias Summarization Metrics
Syntax, Parsing and their Applications (Oral)
Room: West 3
- Order-Theoretic Structured Prediction: Partially Ordering Tokens within a String
- 4 and 7-bit Labeling for Projective and Non-Projective Dependency Trees
- Syntactic Substitutability as Unsupervised Dependency Syntax
- Structural generalization in COGS: Supertagging is (almost) all you need
- CoRec: An Easy Approach for Coordination Recognition
- LLM-enhanced Self-training for Cross-domain Constituency Parsing