Toggle navigation
博客
首页
推荐
标签
轻览
日历
搜索
排序方式:
按更新时间排序
|
按标题排序
|
按浏览次数排序
TIGER:Recommender Systems with Generative Retrieval 生成式召回
论文
推荐系统
召回
Soft MoE《FROM SPARSE TO SOFT MIXTURES OF EXPERTS》
论文
Fast Inference from Transformers via Speculative Decoding
论文
谷歌MLP - Mixer: An all - MLP Architecture for Vision
论文
谷歌
谷歌DiLoCo: Distributed Low-Communication Training of Language Models
论文
LLM
快手冷启动POSO: Personalized Cold Start Modules for Large-scale Recommender Systems
论文
冷启动
快手
快手MARM: Unlocking the Future of Recommendation Systems through Memory Augmentation and Scalable Complexity
论文
快手
论文 | ROFORMER: ENHANCED TRANSFORMER WITH ROTARY POSITION EMBEDDING、RoPE编码
论文
RoPE
论文:Improving Item Cold - start Recommendation via Model - agnostic Conditional Variational Autoencoder
论文
冷启动
Rethinking the Role of Pre-ranking in Large-scale E-Commerce Searching System
论文
粗排
ReLLa: Retrieval-enhanced Large Language Models for Lifelong Sequential Behavior Comprehension in Recommendation
论文
LLM
推荐系统
阿里MIMN模型Practice on Long Sequential User Behavior Modeling for Click-Through Rate Prediction
论文
阿里
论文《Surge Phenomenon in Optimal Learning Rate and Batch Size Scaling》学习率和批量大小
论文
Adaptive Domain Scaling for Personalized Sequential Modeling in Recommenders,ADS自适应域缩放
论文
推荐系统
跨域推荐
字节
DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models
论文
deepseek
TPM基于树的渐进回归模型(Tree based Progressive Regression Model for Watch-Time Prediction in Short-video Recommendation)
论文
回归问题
论文:Large Memory Layers with Product Keys
论文
论文:Key-Value Memory Networks for Directly Reading Documents 键值记忆网络(KV-MemNN)
论文
论文:The Lazy Neuron Phenomenon: On Emergence of Activation Sparsity in Transformers
论文
Transformer
论文:Training Compute-Optimal Large Language Models 最优模型缩放结论
论文
«
1
2
3
4
5
6
7
»