The Evolutionary Trajectory of the Large Language Model

ChatGPTLLMLLMAILLMChatGPTLLMAILLMEtienne BernardNuMindCEOLLMNLPEtienneWolfram ResearchOneFlowOneFlowhttps://www.numind

ChatGPTLLMLLMAILLM


ChatGPTLLMAILLM


Etienne BernardNuMindCEOLLMNLPEtienneWolfram Research


OneFlowOneFlow
https://www.numind.ai/blog/what-are-large-language-models

| Etienne Bernard

OneFlow

|


1

TransformerLLM

2

2

n-gramnn=2n-gramn-gramthn-gramn-gram hith + hi = thii n-gramn-gram

n=4

complaine building thing Lakers inter blous of try sure camp Fican chips always and to New Semested and the to have being severy undiscussion to can you better is early shoot on

nntokenn<6

3

knob

the cat sat on the

RNNLSTMRNNmental staten-gramRNN

RNNNeural Conversational ModelGoogle2015LSTMLLMLLM310

720

LSTM

2017TransformerTransformerTransformer

Transformerhttps://arxiv.org/abs/1706.03762

TransformerGPUTransformerLLM

4

TransformerLLM2018AI

LLMhttps://github.com/Mooler0410/LLMsPracticalGuide

encoder-onlydecoder-only-encoder-decoder

RNNELMoBERTRoBERTaTransformer1GB10GB100GB0.1NLP

OpenAITransformer2018GPT-112019GPT-21540GBGPT-2GPT-2

https://cdn.openai.com/better-language-models/language_models_are_unsupervised_multitask_learners.pdf

RNNGPT-2OpenAIGPT-2LLMprompt

2020OpenAIGPT-3GPT-31750700GBGPT-3LLM

GPT-3GPT-3GPT-3HTML

GPT-3LLM

GPT-3

GPT-3GPT-22018LLM

1000GPT-4110001000100LLM100LLM

read

1000100

LLMGPT-1GPT-33GPT-3

GPT-410

LLM2022NLP

5

LLM

GPT-3LLM

LLM-instruction-answer pairsLLMLLMLLMRLHFOpenAIInstructGPTChatGPT

InstructGPTChatGPThttps://openai.com/blog/chatgpthttps://arxiv.org/abs/2203.02155

LLMLLM

LLMLLMOpenAI202212ChatGPTGPT-3.5InstructGPT-

ChatGPTLLMOpenAIGPT-4GPT-3.5ChatGPTAnthropicClaudeGoogleBardMetaLLaMALLMNuMind

ChatGPTChatGPTChatGPTLLMLLMAI

LLMLLMLLM

6

LLMLLM

>1GPU

150%

LLMGPT-4LLM

RNNRWKV

LLM/LLM

LLMAIAILLMRLHF

LLMLLMWolfram AlphaLLMAPIAPI

LLMhttps://arxiv.org/abs/2302.04761

ChatGPT LangChain Toolformer LLM

LLMLLM chain-of-thoughts promptingLLMLLM

https://arxiv.org/abs/2201.11903

LLMLLMAutoGPT BabyAGILLM

LLM--LLM
https://arxiv.org/abs/2210.11610
LLM

LLMLLMLLMLLMLLM

Star OneFlow

https://github.com/Oneflow-Inc/oneflow/


Disclaimer: The content of this article is sourced from the internet. The copyright of the text, images, and other materials belongs to the original author. The platform reprints the materials for the purpose of conveying more information. The content of the article is for reference and learning only, and should not be used for commercial purposes. If it infringes on your legitimate rights and interests, please contact us promptly and we will handle it as soon as possible! We respect copyright and are committed to protecting it. Thank you for sharing.(Email:[email protected])