博客

LLM

投机采样（Speculative Sampling）

LLM

LLM模型参数量计算

LLM

LLM

谷歌DiLoCo: Distributed Low-Communication Training of Language Models

KV Cache（键值缓存）

Transformer LLM

ReLLa: Retrieval-enhanced Large Language Models for Lifelong Sequential Behavior Comprehension in Recommendation

论文 LLM 推荐系统

QLoRA（Quantized Low-Rank Adapter）

LLM

腾讯混元大模型

Thinking Claude prompt工具

LLM

Grok-3：xAI团队研发的第三代大语言模型

DeepSeek目前发布的模型版本（持续更新）

PPO（Proximal Policy Optimization）近端策略优化

LLM

LLM | LLM入门

LLM

Reinforcement Learning from Human Feedback（RLHF）

LLM

BBPE（Byte-Level Byte Pair Encoding）字节级字节对编码

LLM

XLNet：一种基于Transformer架构的自回归语言模型

LLM

«
1
2
3
4
5
6
»