ONE·PIECE
首页
大模型
  • LLM技术栈
  • SKILLs
  • Harness工程
系统设计
机器学习
Android
LeetCode
AI头条
论文推荐
分类
标签
归档
知识图谱
友链
关于
充电驿站
  • 小书屋
  • 大影单
搜索
文章
287
标签
105
分类
21

首页
大模型
  • LLM技术栈
  • SKILLs
  • Harness工程
系统设计
机器学习
Android
LeetCode
AI头条
论文推荐
分类
标签
归档
知识图谱
友链
关于
充电驿站
  • 小书屋
  • 大影单

The Most Epic
Mountains in the World

KNOW MORE ↓

The Recommended Of

PAPERS

  • All
  • LLM
  • Multimodal
  • Diffusion
  • Reinforcement Learning
  • Deep Learning
  • Machine Learning
  • NLP
  • CV
过滤器相应归一化层:在深度神经网络的训练中消除批次依赖性
过滤器相应归一化层:在深度神经网络的训练中消除批次依赖性
Deep Learning
A Multigrid Method for Efficiently Training Video Models
A Multigrid Method for Efficiently Training Video Models
Computer Vision and Pattern Recognition
KernelNet:用于深度生成建模的数据依赖的内核参数化
KernelNet:用于深度生成建模的数据依赖的内核参数化
Machine Learning
AP-Perf:在可微学习中整合通用的性能指标
AP-Perf:在可微学习中整合通用的性能指标
Deep Learning
高效的卷积神经网络用于基于深度的多姿态估计
高效的卷积神经网络用于基于深度的多姿态估计
Computer Vision and Pattern Recognition
Conclusion-Supplement Answer Generation for Non-Factoid Questions
Conclusion-Supplement Answer Generation for Non-Factoid Questions
Natural Language Processing
多域图像分割的对抗性归一化
多域图像分割的对抗性归一化
Machine Learning
通过可赋予的反例实现深度神经网络指纹识别
通过可赋予的反例实现深度神经网络指纹识别
Deep Learning
Attention Is All You Need — Transformer 奠基之作
Attention Is All You Need — Transformer 奠基之作
Large Language Model
BERT:Deep Bidirectional Transformers for Language Understanding
BERT:Deep Bidirectional Transformers for Language Understanding
Large Language Model
GPT-3:Language Models are Few-Shot Learners
GPT-3:Language Models are Few-Shot Learners
Large Language Model
T5:Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
T5:Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
Large Language Model
InstructGPT:Training Language Models to Follow Instructions with Human Feedback
InstructGPT:Training Language Models to Follow Instructions with Human Feedback
Large Language Model
Constitutional AI:Harmlessness from AI Feedback(Claude 对齐论文)
Constitutional AI:Harmlessness from AI Feedback(Claude 对齐论文)
Large Language Model
Direct Preference Optimization (DPO):Your Language Model is Secretly a Reward Model
Direct Preference Optimization (DPO):Your Language Model is Secretly a Reward Model
Large Language Model
LLaMA:Open and Efficient Foundation Language Models
LLaMA:Open and Efficient Foundation Language Models
Large Language Model
LLaMA 2:Open Foundation and Fine-Tuned Chat Models
LLaMA 2:Open Foundation and Fine-Tuned Chat Models
Large Language Model
Mistral 7B — 高效开源小模型新标杆
Mistral 7B — 高效开源小模型新标杆
Large Language Model
Mixtral of Experts — 稀疏混合专家架构实践
Mixtral of Experts — 稀疏混合专家架构实践
Large Language Model
LoRA:Low-Rank Adaptation of Large Language Models
LoRA:Low-Rank Adaptation of Large Language Models
Large Language Model
QLoRA:Efficient Finetuning of Quantized LLMs
QLoRA:Efficient Finetuning of Quantized LLMs
Large Language Model
Prefix-Tuning:Optimizing Continuous Prompts for Generation
Prefix-Tuning:Optimizing Continuous Prompts for Generation
Large Language Model
RAG:Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks
RAG:Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks
Large Language Model
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Large Language Model
ReAct:Synergizing Reasoning and Acting in Language Models
ReAct:Synergizing Reasoning and Acting in Language Models
Large Language Model
Tree of Thoughts:Deliberate Problem Solving with Large Language Models
Tree of Thoughts:Deliberate Problem Solving with Large Language Models
Large Language Model
Mamba:Linear-Time Sequence Modeling with Selective State Spaces
Mamba:Linear-Time Sequence Modeling with Selective State Spaces
Large Language Model
GPT-4 Technical Report
GPT-4 Technical Report
Large Language Model
DeepSeek-R1:Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
DeepSeek-R1:Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
Large Language Model
Phi-3 Technical Report:A Highly Capable Language Model Locally on Your Phone
Phi-3 Technical Report:A Highly Capable Language Model Locally on Your Phone
Large Language Model
CLIP:Learning Transferable Visual Models From Natural Language Supervision
CLIP:Learning Transferable Visual Models From Natural Language Supervision
Multimodal
DALL-E:Zero-Shot Text-to-Image Generation
DALL-E:Zero-Shot Text-to-Image Generation
Multimodal
Flamingo:A Visual Language Model for Few-Shot Learning
Flamingo:A Visual Language Model for Few-Shot Learning
Multimodal
LLaVA:Visual Instruction Tuning
LLaVA:Visual Instruction Tuning
Multimodal
ImageBind:One Embedding Space To Bind Them All
ImageBind:One Embedding Space To Bind Them All
Multimodal
Gemini:A Family of Highly Capable Multimodal Models
Gemini:A Family of Highly Capable Multimodal Models
Multimodal
DDPM:Denoising Diffusion Probabilistic Models
DDPM:Denoising Diffusion Probabilistic Models
Diffusion Models
LDM / Stable Diffusion:High-Resolution Image Synthesis with Latent Diffusion Models
LDM / Stable Diffusion:High-Resolution Image Synthesis with Latent Diffusion Models
Diffusion Models
DALL-E 2:Hierarchical Text-Conditional Image Generation with CLIP Diffusion
DALL-E 2:Hierarchical Text-Conditional Image Generation with CLIP Diffusion
Diffusion Models
DiT:Scalable Diffusion Models with Transformers
DiT:Scalable Diffusion Models with Transformers
Diffusion Models
Score-Based Generative Modeling Through Stochastic Differential Equations
Score-Based Generative Modeling Through Stochastic Differential Equations
Diffusion Models
Sora:Video Generation as World Simulators
Sora:Video Generation as World Simulators
Diffusion Models
PPO:Proximal Policy Optimization Algorithms
PPO:Proximal Policy Optimization Algorithms
Reinforcement Learning
DQN:Human-Level Control through Deep Reinforcement Learning
DQN:Human-Level Control through Deep Reinforcement Learning
Reinforcement Learning
SAC:Soft Actor-Critic — Off-Policy Maximum Entropy Deep Reinforcement Learning
SAC:Soft Actor-Critic — Off-Policy Maximum Entropy Deep Reinforcement Learning
Reinforcement Learning
AlphaFold2:Highly Accurate Protein Structure Prediction with AlphaFold
AlphaFold2:Highly Accurate Protein Structure Prediction with AlphaFold
Reinforcement Learning
GRPO:Group Relative Policy Optimization(DeepSeek-R1 训练方法)
GRPO:Group Relative Policy Optimization(DeepSeek-R1 训练方法)
Reinforcement Learning
DreamerV3:Mastering Diverse Domains through World Models
DreamerV3:Mastering Diverse Domains through World Models
Reinforcement Learning
©2016 - 2019 By Leo·Cheung
Some of life, you have to go to the great challanges. - By Kobe Bryant
浙ICP备19024714号
简
Algolia