AIBQUEST | Paper List

Challenges and Applications of Large Language Models

LLM-Adapters: an Adapter Family for Parameter-Efficient Fine-Tuning of Large Language Models

Accelerating Large Language Model Decoding with Speculative Sampling

GPT Detectors are Biased Against Non-native English Writers

GPT4All: Training an Assistant-style Chatbot with Large Scale Data Distillation from GPT-3.5-Turbo

SELF-INSTRUCT: Aligning Language Model with Self Generated Instructions

Efficient Methods for Natural Language Processing: a Survey

Better Language Models of Code Through Self-Improvement

Distilling Step-by-Step! Outperforming Larger Language Models with Less Training Data and Smaller Model Sizes

Active Retrieval Augmented Generation

FrugalGPT: How to Use Large Language Models While Reducing Cost and Improving Performance

Exploring In-Context Learning Capabilities of Foundation Models for Generating Knowledge Graphs from Text

How Language Model Hallucinations Can Snowball

Unlimiformer: Long-Range Transformers with Unlimited Length Input

Gorilla: Large Language Model Connected with Massive APIs

SPRING: GPT-4 Out-performs RL Algorithms by Studying Papers and Reasoning

Deliberate Then Generate: Enhanced Prompting Framework for Text Generation

Enabling Large Language Models to Generate Text with Citations

The RefinedWeb Dataset for Falcon LLM: Outperforming Curated Corpora with Web Data, and Web Data Only

Fine-Tuning Language Models with Just Forward Passes

Self-Consistency Improves Chain of Thought Reasoning in Language Models

Tree of Thoughts: Deliberate Problem Solving with Large Language Models

Beyond Chain-of-Thought, Effective Graph-of-Thought Reasoning in Large Language Models

LLM-QAT: Data-Free Quantization Aware Training for Large Language Models

RWKV: Reinventing RNNs for the Transformer Era

Knowledge Distillation of Large Language Models

Unifying Large Language Models and Knowledge Graphs: a Roadmap

Orca: Progressive Learning from Complex Explanation Traces of GPT-4

Textbooks are All You Need

Extending Context Window of Large Language Models Via Positional Interpolation

Deep Language Networks: Joint Prompt Training of Stacked LLMs Using Variational Inference

A Simple and Effective Pruning Approach for Large Language Models

To Repeat or Not to Repeat: Insights from Scaling LLM Under Token-Crisis

ART: Automatic Multi-step Reasoning and Tool-use for Large Language Models

Lost in the Middle: How Language Models Use Long Contexts

Improving Retrieval-Augmented Large Language Models Via Data Importance Learning

Scaling Transformer to 1M Tokens and Beyond with RMT

Hyena Hierarchy: Towards Larger Convolutional Language Models

LongNet: Scaling Transformers to 1,000,000,000 Tokens

The Curse of Recursion: Training on Generated Data Makes Models Forget

Artificial Artificial Artificial Intelligence: Crowd Workers Widely Use Large Language Models for Text Production Tasks

FLASK: Fine-grained Language Model Evaluation Based on Alignment Skill Sets

Secrets of RLHF in Large Language Models Part I: PPO

WizardLM: Empowering Large Language Models to Follow Complex Instructions

Universal and Transferable Adversarial Attacks on Aligned Language Models

Scaling TransNormer to 175 Billion Parameters

What Learning Algorithm is In-context Learning? Investigations with Linear Models

What In-Context Learning “Learns” In-Context: Disentangling Task Recognition and Task Learning

PanGu-Coder2: Boosting Large Language Models for Code with Ranking Feedback

Multimodal Neurons in Pretrained Text-Only Transformers

ToolLLM: Facilitating Large Language Models to Master 16000+ Real-world APIs

Skeleton-of-Thought: Large Language Models Can Do Parallel Decoding

The Hydra Effect: Emergent Self-repair in Language Model Computations

MetaGPT: Meta Programming for Multi-Agent Collaborative Framework

XSTest: a Test Suite for Identifying Exaggerated Safety Behaviours in Large Language Models

Tool Documentation Enables Zero-Shot Tool-Usage with Large Language Models

Jina Embeddings: a Novel Set of High-Performance Sentence Embedding Models

SelfCheck: Using LLMs to Zero-Shot Check Their Own Step-by-Step Reasoning

Stack More Layers Differently: High-Rank Training Through Low-Rank Updates

AlpaGasus: Training a Better Alpaca with Fewer Data

How is ChatGPT’s Behavior Changing Over Time?

Do Multilingual Language Models Think Better in English?

Skill-it! a Data-Driven Skills Framework for Understanding and Training Language Models

In-context Autoencoder for Context Compression in a Large Language Model

No Train No Gain: Revisiting Efficient Training Algorithms for Transformer-based Language Models

Leveraging Implicit Feedback from Deployment Data in Dialogue

FacTool: Factuality Detection in Generative AI – a Tool Augmented Framework for Multi-Task and Multi-Domain Scenarios

The Belebele Benchmark: a Parallel Reading Comprehension Dataset in 122 Language Variants

Large Language Models Can be Easily Distracted by Irrelevant Context

Fast Inference from Transformers Via Speculative Decoding

Textbooks are All You Need II

Cognitive Mirage: a Review of Hallucinations in Large Language Models

Structured Chain-of-Thought Prompting for Code Generation

DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models

QA-LoRA: Quantization-Aware Low-Rank Adaptation of Large Language Models

Editing Commonsense Knowledge in GPT

The CoT Collection: Improving Zero-shot and Few-shot Learning of Language Models Via Chain-of-Thought Fine-Tuning

FinGPT: Open-Source Financial Large Language Models

The Reversal Curse: LLMs Trained on “A is B” Fail to Learn “B is A”

Less is More: Task-aware Layer-wise Distillation for Language Model Compression

Reinforced Self-Training (ReST) for Language Modeling

How Do Large Language Models Capture the Ever-changing World Knowledge? a Review of Recent Advances

Diffusion Language Models Can Perform Many Tasks with Scaling and Instruction-Finetuning

A Reparameterized Discrete Diffusion Model for Text Generation

AR-Diffusion: Auto-Regressive Diffusion Model for Text Generation

HomoDistil: Homotopic Task-Agnostic Distillation of Pre-trained Transformers

Likelihood-Based Diffusion Language Models

Who’s Harry Potter? Approximate Unlearning in LLMs

Mistral 7B

Take a Step Back: Evoking Reasoning Via Abstraction in Large Language Models

Text Generation with Diffusion Language Models: a Pre-training Approach with Continuous Paragraph Denoise

JudgeLM: Fine-tuned Large Language Models are Scalable Judges

LLMs As Factual Reasoners: Insights from Existing Benchmarks and Beyond

Llemma: an Open Language Model for Mathematics

CODEFUSION: a Pre-trained Diffusion Model for Code Generation

CodeT5+: Open Code Large Language Models for Code Understanding and Generation

Augmenting Language Models with Long-Term Memory

ALCUNA: Large Language Models Meet New Knowledge

The Perils & Promises of Fact-checking with Large Language Models

SLoRA: Federated Parameter Efficient Fine-Tuning of Language Models

S-LoRA: Serving Thousands of Concurrent LoRA Adapters

ChainPoll: a High Efficacy Method for LLM Hallucination Detection

Mixture-of-Experts Meets Instruction Tuning: a Winning Combination for Large Language Models

LLM-Blender: Ensembling Large Language Models with Pairwise Ranking and Generative Fusion

Semantic Uncertainty: Linguistic Invariances for Uncertainty Estimation in Natural Language Generation

FACTSCORE: Fine-grained Atomic Evaluation of Factual Precision in Long Form Text Generation

Fine-tuning Language Models for Factuality

Better Zero-Shot Reasoning with Self-Adaptive Prompting

Universal Self-Adaptive Prompting for Zero-shot and Few-shot Learning

Chain-of-Note: Enhancing Robustness in Retrieval-Augmented Language Models

Thread of Thought: Unraveling Chaotic Contexts

Large Language Models Understand and Can be Enhanced by Emotional Stimuli

Text Embeddings Reveal (Almost) As Much As Text

Influence Scores at Scale for Efficient Language Data Sampling

TableLlama: Towards Open Large Generalist Models for Tables

NEFTune: Noisy Embeddings Improve Instruction Finetuning

Break the Sequential Dependency of LLM Inference Using Lookahead Decoding

Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads

Online Speculative Decoding

PaSS: Parallel Speculative Sampling

System 2 Attention (is Something You Might Need Too)

Aligning Large Language Models Through Synthetic Feedback

Contrastive Chain-of-Thought Prompting

ChipNeMo: Domain-Adapted LLMs for Chip Design

Efficient Streaming Language Models with Attention Sinks

Precise Zero-Shot Dense Retrieval Without Relevance Labels

Tied-LoRA: Enhancing Parameter Efficiency of LoRA with Weight Tying

Adaptive Budget Allocation for Parameter-Efficient Fine-Tuning

Chain-of-Knowledge: Grounding Large Language Models Via Dynamic Knowledge Adapting Over Heterogeneous Sources

Exponentially Faster Language Modeling

Prompt Injection Attack Against LLM-integrated Applications

Jailbroken: How Does LLM Safety Training Fail?

Orca 2: Teaching Small Language Models How to Reason

Rephrase and Respond: Let Large Language Models Ask Better Questions for Themselves

Pythia: a Suite for Analyzing Large Language Models Across Training and Scaling

Starling-7B: Increasing LLM Helpfulness & Harmlessness with RLAIF

Large Language Models are Human-Level Prompt Engineers

A Survey of Graph Meets Large Language Model: Progress and Future Directions

Nash Learning from Human Feedback

Magicoder: Source Code is All You Need

TarGEN: Targeted Data Generation with Large Language Models

RLAIF: Scaling Reinforcement Learning from Human Feedback with AI Feedback

The Alignment Ceiling: Objective Mismatch in Reinforcement Learning from Human Feedback

Revisiting Large Language Models As Zero-shot Relation Extractors

NexusRaven-V2: Surpassing GPT-4 for Zero-shot Function Calling

Instruction-Following Evaluation for Large Language Models

Just Ask for Calibration: Strategies for Eliciting Calibrated Confidence Scores from Language Models Fine-Tuned with Human Feedback

A Comprehensive Survey on Vector Database: Storage and Retrieval Technique, Challenges

FLEEK: Factual Error Detection and Correction with Evidence Retrieved from External Knowledge

Automatic Hallucination Assessment for Aligned Large Language Models Via Transferable Adversarial Attacks

OLaLa: Ontology Matching with Large Language Models

LLM-Pruner: on the Structural Pruning of Large Language Models

SparseGPT: Massive Language Models Can be Accurately Pruned in One-Shot

RAGAS: Automated Evaluation of Retrieval Augmented Generation

EVER: Mitigating Hallucination in Large Language Models Through Real-Time Verification and Rectification

Prometheus: Inducing Fine-Grained Evaluation Capability in Language Models

AlphaCode 2

MediTron-70B: Scaling Medical Pretraining for Large Language Models

Can Generalist Foundation Models Outcompete Special-Purpose Tuning? Case Study in Medicine

Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks

SwiftSage: a Generative Agent with Fast and Slow Thinking for Complex Interactive Tasks

Evaluating Large Language Models: a Comprehensive Survey

Show Your Work: Scratchpads for Intermediate Computation with Language Models

Program of Thoughts Prompting: Disentangling Computation from Reasoning for Numerical Reasoning Tasks

Chain of Code: Reasoning with a Language Model-Augmented Code Emulator

When Do Generative Query and Document Expansions Fail? a Comprehensive Study Across Methods Retrievers and Datasets

MemGPT: Towards LLMs As Operating Systems

The Internal State of an LLM Knows When It’s Lying

GPT4All: an Ecosystem of Open Source Compressed Language Models

The Falcon Series of Open Language Models

Promptbase: Elevating the Power of Foundation Models Through Advanced Prompt Engineering

Phi-2: the Surprising Power of Small Language Models

QMoE: Practical Sub-1-Bit Compression of Trillion-Parameter Models

PromptBench: a Unified Library for Evaluation of Large Language Models

Making LLMs Worth Every Penny: Resource-Limited Text Classification in Banking

Mathematical Language Models: a Survey

A Survey of Large Language Models in Medicine: Principles, Applications, and Challenges

Language Model Inversion

LLM360: Towards Fully Transparent Open-Source LLMs

LLMLingua: Compressing Prompts for Accelerated Inference of Large Language Models

Retrieval-Augmented Generation for Large Language Models: a Survey

LLM in a Flash: Efficient Large Language Model Inference with Limited Memory

ReST Meets ReAct: Self-Improvement for Multi-Step Reasoning LLM Agent

PowerInfer: Fast Large Language Model Serving with a Consumer-grade GPU

Adversarial Attacks on GPT-4 Via Simple Random Search

An In-depth Look at Gemini’s Language Abilities

LoRAMoE: Revolutionizing Mixture of Experts for Maintaining World Knowledge in Language Model Alignment

Large Language Models are Better Reasoners with Self-Verification

PaperMage: a Unified Toolkit for Processing, Representing, and Manipulating Visually-Rich Scientific Documents

Large Language Models on Graphs: a Comprehensive Survey

An LLM Compiler for Parallel Function Calling

Scaling Down, LiTting Up: Efficient Zero-Shot Listwise Reranking with Seq2seq Encoder–Decoder Models

Is ChatGPT Good at Search? Investigating Large Language Models As Re-Ranking Agents

NPHardEval: Dynamic Benchmark on Reasoning Ability of Large Language Models Via Complexity Classes

LongQLoRA: Efficient and Effective Method to Extend Context Length of Large Language Models

Editing Models with Task Arithmetic

Time is Encoded in the Weights of Finetuned Language Models

TinyGPT-V: Efficient Multimodal Large Language Model Via Small Backbones

OpenChat: Advancing Open-Source Language Models with Mixed-Quality Data

What Makes Good Data for Alignment? a Comprehensive Study of Automatic Data Selection in Instruction Tuning

Judging LLM-as-a-Judge with MT-Bench and Chatbot Arena

Cheap and Quick: Efficient Vision-Language Instruction Tuning for Large Language Models

Dense X Retrieval: What Retrieval Granularity Should We Use?

ARES: an Automated Evaluation Framework for Retrieval-Augmented Generation Systems

A Comprehensive Survey of Hallucination Mitigation Techniques in Large Language Models

A Survey of Reasoning with Foundation Models

GPT-4V(ision) is a Generalist Web Agent, If Grounded

Large Language Models for Generative Information Extraction: a Survey

EQ-Bench: an Emotional Intelligence Benchmark for Large Language Models

Turbulence: Systematically and Automatically Testing Instruction-Tuned Large Language Models for Code

TrustLLM: Trustworthiness in Large Language Models

Blending is All You Need: Cheaper Better Alternative to Trillion-Parameters LLM

Enhancing Zero-Shot Chain-of-Thought Reasoning in Large Language Models Through Logic

Airavata: Introducing Hindi Instruction-Tuned LLM

Chain-of-Symbol Prompting for Spatial Relationships in Large Language Models

Continual Pre-training of Language Models

Jina Embeddings: a Novel Set of High-Performance Sentence Embedding Models

Simplifying Transformer Blocks

Promptbreeder: Self-Referential Self-Improvement Via Prompt Evolution

AnglE-Optimized Text Embeddings

SLiC-HF: Sequence Likelihood Calibration with Human Feedback

Ghostbuster: Detecting Text Ghostwritten by Large Language Models

Monarch Mixer: a Simple Sub-Quadratic GEMM-Based Architecture

DistillCSE: Distilled Contrastive Learning for Sentence Embeddings

The Unlocking Spell on Base LLMs: Rethinking Alignment Via In-Context Learning

GLiNER: Generalist Model for Named Entity Recognition Using Bidirectional Transformer

Research Paper

Paper List

on Sept. 18, 2024

Research Papers List

RELATED ARTICLES

Paper List

POST COMMENT

Research Paper

Paper List

on Sept. 18, 2024

Research Papers List

RELATED ARTICLES

Paper List

POST COMMENT

Stay Updated with AIBQUEST

Join Our Newsletter