LLM Core Papers | LLM核心论文

Sentiment Neuron: &nbsp; Learning to Generate Reviews and Discovering Sentiment
GPT-1: &nbsp;<a href="https://link.zhihu.com/?target=https%3A//s3-us-west-2.amazonaws.com/openai-assets/research-covers/language-unsupervised/language_understanding_paper.pdf">Improving Language Understanding by Generative Pre-Training
Scaling Law: &nbsp; Scaling Laws for Neural Language Models
GPT-3: &nbsp; Language Models are Few-Shot Learners

2023-11-22 (Updated: 2023-11-22)

Large Language Models (LLMs) are cool, yet the sheer volume of research, with hundreds of new papers daily, can be overwhelming. Few papers, however, truly endure. The selection below is curated through extensive discussions with my friends, and have shaped my foundational perspective in this field. I hope this paper list can offer some help to those new to the field. And let me know how you think about this paper collection!

LLM虽然是灌水天堂，每天都有百来篇论文上arxiv，但大浪淘沙，真正有信息量、而又通用深刻的工作并不多。以下这些论文是和朋友多次讨论后留下的核心工作，他们很大程度上构成了我思考NLP问题的框架。