ONE·PIECE
首页
大模型
LLM技术栈
SKILLs
Harness工程
系统设计
机器学习
Android
LeetCode
AI头条
论文推荐
分类
标签
归档
知识图谱
友链
关于
充电驿站
小书屋
大影单
搜索
文章
287
标签
105
分类
21
首页
大模型
LLM技术栈
SKILLs
Harness工程
系统设计
机器学习
Android
LeetCode
AI头条
论文推荐
分类
标签
归档
知识图谱
友链
关于
充电驿站
小书屋
大影单
The Most Epic
Mountains
in the
World
KNOW MORE ↓
The Recommended Of
PAPERS
All
LLM
Multimodal
Diffusion
Reinforcement Learning
Deep Learning
Machine Learning
NLP
CV
过滤器相应归一化层:在深度神经网络的训练中消除批次依赖性
Deep Learning
A Multigrid Method for Efficiently Training Video Models
Computer Vision and Pattern Recognition
KernelNet:用于深度生成建模的数据依赖的内核参数化
Machine Learning
AP-Perf:在可微学习中整合通用的性能指标
Deep Learning
高效的卷积神经网络用于基于深度的多姿态估计
Computer Vision and Pattern Recognition
Conclusion-Supplement Answer Generation for Non-Factoid Questions
Natural Language Processing
多域图像分割的对抗性归一化
Machine Learning
通过可赋予的反例实现深度神经网络指纹识别
Deep Learning
Attention Is All You Need — Transformer 奠基之作
Large Language Model
BERT:Deep Bidirectional Transformers for Language Understanding
Large Language Model
GPT-3:Language Models are Few-Shot Learners
Large Language Model
T5:Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
Large Language Model
InstructGPT:Training Language Models to Follow Instructions with Human Feedback
Large Language Model
Constitutional AI:Harmlessness from AI Feedback(Claude 对齐论文)
Large Language Model
Direct Preference Optimization (DPO):Your Language Model is Secretly a Reward Model
Large Language Model
LLaMA:Open and Efficient Foundation Language Models
Large Language Model
LLaMA 2:Open Foundation and Fine-Tuned Chat Models
Large Language Model
Mistral 7B — 高效开源小模型新标杆
Large Language Model
Mixtral of Experts — 稀疏混合专家架构实践
Large Language Model
LoRA:Low-Rank Adaptation of Large Language Models
Large Language Model
QLoRA:Efficient Finetuning of Quantized LLMs
Large Language Model
Prefix-Tuning:Optimizing Continuous Prompts for Generation
Large Language Model
RAG:Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks
Large Language Model
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Large Language Model
ReAct:Synergizing Reasoning and Acting in Language Models
Large Language Model
Tree of Thoughts:Deliberate Problem Solving with Large Language Models
Large Language Model
Mamba:Linear-Time Sequence Modeling with Selective State Spaces
Large Language Model
GPT-4 Technical Report
Large Language Model
DeepSeek-R1:Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
Large Language Model
Phi-3 Technical Report:A Highly Capable Language Model Locally on Your Phone
Large Language Model
CLIP:Learning Transferable Visual Models From Natural Language Supervision
Multimodal
DALL-E:Zero-Shot Text-to-Image Generation
Multimodal
Flamingo:A Visual Language Model for Few-Shot Learning
Multimodal
LLaVA:Visual Instruction Tuning
Multimodal
ImageBind:One Embedding Space To Bind Them All
Multimodal
Gemini:A Family of Highly Capable Multimodal Models
Multimodal
DDPM:Denoising Diffusion Probabilistic Models
Diffusion Models
LDM / Stable Diffusion:High-Resolution Image Synthesis with Latent Diffusion Models
Diffusion Models
DALL-E 2:Hierarchical Text-Conditional Image Generation with CLIP Diffusion
Diffusion Models
DiT:Scalable Diffusion Models with Transformers
Diffusion Models
Score-Based Generative Modeling Through Stochastic Differential Equations
Diffusion Models
Sora:Video Generation as World Simulators
Diffusion Models
PPO:Proximal Policy Optimization Algorithms
Reinforcement Learning
DQN:Human-Level Control through Deep Reinforcement Learning
Reinforcement Learning
SAC:Soft Actor-Critic — Off-Policy Maximum Entropy Deep Reinforcement Learning
Reinforcement Learning
AlphaFold2:Highly Accurate Protein Structure Prediction with AlphaFold
Reinforcement Learning
GRPO:Group Relative Policy Optimization(DeepSeek-R1 训练方法)
Reinforcement Learning
DreamerV3:Mastering Diverse Domains through World Models
Reinforcement Learning
简
Algolia