Full List of Publications and Manuscripts

2024

HYDRA: Model Factorization Framework for Black-Box LLM Personalization
Aligning Large Language Models with Representation Editing: A Control Perspective
BBox-Adapter: Lightweight Adapting for Black-Box Large Language Models
Two Birds with One Stone: Enhancing Uncertainty Quantification and Interpretability with Graph Functional Neural Process
ToolChain*: Efficient Action Space Navigation in Large Language Models with A* Search
RAM-EHR: Retrieval Augmentation Meets Clinical Predictions on Electronic Health Records
POLYIE: A Dataset of Information Extraction from Polymer Material Scientific Literature
HiGen: Hierarchy-Aware Sequence Generation for Hierarchical Text Classification
TPD: Enhancing Student Language Model Reasoning via Principle Discovery and Guidance
EHRAgent: Code Empowers Large Language Models for Complex Tabular Reasoning on Electronic Health Records
MedAdapter: Efficient Test-Time Adaptation of Large Language Models towards Medical Reasoning
BMRetriever: Tuning Large Language Models as Better Biomedical Text Retrievers
Efficient Evolutionary Search over Chemical Space with Large Language Models

2023

How Many Validation Labels Do You Need? Exploring the Design Space of Label-Efficient Model Ranking
Knowledge-Infused Prompting Improves Clinical Text Generation with Large Language Models
Retrieval-Augmented Large Language Models for Adolescent Idiopathic Scoliosis Patients in Shared Decision-Making
DF2: Distribution-Free Decision-Focused Learning
AdaPlanner: Adaptive Planning from Feedback with Language Models
ToolQA: A Dataset for LLM Question Answering with External Tools
Large Language Model as Attributed Training Data Generator: A Tale of Diversity and Bias
G-STO: Sequential Main Shopping Intention Detection via Graph-Regularized Stochastic Transformer
Autoregressive Diffusion Model for Graph Generation
MUBen: Benchmarking the Uncertainty of Pre-Trained Models for Molecular Property Prediction
DyGen: Learning from Noisy Labels via Dynamics-Enhanced Generative Modeling
ReGen: Zero-Shot Text Classification via Training Data Generation with Progressive Dense Retrieval
End-to-End Stochastic Optimization with Energy-Based Model

2022

ReSel: N-ary Relation Extraction from Scientific Text and Tables by Learning to Retrieve and Select

2020

Calibrated language model fine-tuning for in-and out-of-distribution data
EXAM: An Explainable Attention-based Model for COVID-19 Automatic Diagnosis