Toggle navigation
博客
首页
推荐
标签
轻览
日历
搜索
排序方式:
按更新时间排序
|
按标题排序
|
按浏览次数排序
Efficient Streaming Language Models with Attention Sinks
论文
LLM
Asynchronous Stochastic Gradient Descent with Delay Compensation
论文
优化器
论文:Perceiver - General Perception with Iterative Attention
论文
Transformer
Google
Deepmind
AdaF2M2 : Comprehensive Learning and Responsive Leveraging Features in Recommendation System
论文
字节
CLS, COMPOSITE SLICE TRANSFORMER: AN EFFICIENT TRANSFORMER WITH COMPOSITION OF MULTI-SCALE MULTI-RANGE ATTENTIONS
论文
TRANSFORMER
Funnel-Transformer: Filtering out Sequential Redundancy for Efficient Language Processing
论文
TIGER:Recommender Systems with Generative Retrieval 生成式召回
论文
推荐系统
召回
Soft MoE《FROM SPARSE TO SOFT MIXTURES OF EXPERTS》
论文
Fast Inference from Transformers via Speculative Decoding
论文
谷歌MLP - Mixer: An all - MLP Architecture for Vision
论文
谷歌
谷歌DiLoCo: Distributed Low-Communication Training of Language Models
论文
LLM
快手冷启动POSO: Personalized Cold Start Modules for Large-scale Recommender Systems
论文
冷启动
快手
快手MARM: Unlocking the Future of Recommendation Systems through Memory Augmentation and Scalable Complexity
论文
快手
论文 | ROFORMER: ENHANCED TRANSFORMER WITH ROTARY POSITION EMBEDDING、RoPE编码
论文
RoPE
论文:Improving Item Cold - start Recommendation via Model - agnostic Conditional Variational Autoencoder
论文
冷启动
Rethinking the Role of Pre-ranking in Large-scale E-Commerce Searching System
论文
粗排
ReLLa: Retrieval-enhanced Large Language Models for Lifelong Sequential Behavior Comprehension in Recommendation
论文
LLM
推荐系统
阿里MIMN模型Practice on Long Sequential User Behavior Modeling for Click-Through Rate Prediction
论文
阿里
论文《Surge Phenomenon in Optimal Learning Rate and Batch Size Scaling》学习率和批量大小
论文
Adaptive Domain Scaling for Personalized Sequential Modeling in Recommenders,ADS自适应域缩放
论文
推荐系统
跨域推荐
字节
«
1
2
3
4
5
6
7
»