The Evolutionary Trajectory of the Large Language Model
ChatGPTLLMLLMAILLMChatGPTLLMAILLMEtienne BernardNuMindCEOLLMNLPEtienneWolfram ResearchOneFlowOneFlowhttps://www.numind

ChatGPTLLMLLMAILLM
ChatGPTLLMAILLM
Etienne BernardNuMindCEOLLMNLPEtienneWolfram Research
OneFlowOneFlow
https://www.numind.ai/blog/what-are-large-language-models
| Etienne Bernard
OneFlow
|
1
TransformerLLM
2
2

n-gramnn=2n-gramn-gramthn-gramn-gram hith + hi = thii n-gramn-gram
n=4
complaine building thing Lakers inter blous of try sure camp Fican chips always and to New Semested and the to have being severy undiscussion to can you better is early shoot on
nntokenn<6
3
knob

the cat sat on the

RNNLSTMRNNmental staten-gramRNN
RNNNeural Conversational ModelGoogle2015LSTMLLMLLM310
720
LSTM
2017TransformerTransformerTransformer

Transformerhttps://arxiv.org/abs/1706.03762
TransformerGPUTransformerLLM
4
TransformerLLM2018AI

LLMhttps://github.com/Mooler0410/LLMsPracticalGuide
encoder-onlydecoder-only-encoder-decoder
RNNELMoBERTRoBERTaTransformer1GB10GB100GB0.1NLP
OpenAITransformer2018GPT-112019GPT-21540GBGPT-2GPT-2

https://cdn.openai.com/better-language-models/language_models_are_unsupervised_multitask_learners.pdf
RNNGPT-2OpenAIGPT-2LLMprompt
2020OpenAIGPT-3GPT-31750700GBGPT-3LLM
GPT-3GPT-3GPT-3HTML

GPT-3LLM
GPT-3
GPT-3GPT-22018LLM

1000GPT-4110001000100LLM100LLM
read

1000100
LLMGPT-1GPT-33GPT-3
GPT-410

LLM2022NLP
5
LLM
GPT-3LLM
LLM-instruction-answer pairsLLMLLMLLMRLHFOpenAIInstructGPTChatGPT

InstructGPTChatGPThttps://openai.com/blog/chatgpthttps://arxiv.org/abs/2203.02155
LLMLLM
LLMLLMOpenAI202212ChatGPTGPT-3.5InstructGPT-
ChatGPTLLMOpenAIGPT-4GPT-3.5ChatGPTAnthropicClaudeGoogleBardMetaLLaMALLMNuMind
ChatGPTChatGPTChatGPTLLMLLMAI
LLMLLMLLM
6
LLMLLM
>1GPU
150%
LLMGPT-4LLM
RNNRWKV
LLM/LLM
LLMAIAILLMRLHF
LLMLLMWolfram AlphaLLMAPIAPI

LLMhttps://arxiv.org/abs/2302.04761
ChatGPT LangChain Toolformer LLM
LLMLLM chain-of-thoughts promptingLLMLLM

https://arxiv.org/abs/2201.11903
LLMLLMAutoGPT BabyAGILLM
LLM--LLM
https://arxiv.org/abs/2210.11610LLM
LLMLLMLLMLLMLLM
Star OneFlow
https://github.com/Oneflow-Inc/oneflow/
Disclaimer: The content of this article is sourced from the internet. The copyright of the text, images, and other materials belongs to the original author. The platform reprints the materials for the purpose of conveying more information. The content of the article is for reference and learning only, and should not be used for commercial purposes. If it infringes on your legitimate rights and interests, please contact us promptly and we will handle it as soon as possible! We respect copyright and are committed to protecting it. Thank you for sharing.(Email:[email protected])