The Recommended Of

PAPERS

过滤器相应归一化层：在深度神经网络的训练中消除批次依赖性

过滤器相应归一化层：在深度神经网络的训练中消除批次依赖性

Deep Learning

A Multigrid Method for Efficiently Training Video Models

A Multigrid Method for Efficiently Training Video Models

Computer Vision and Pattern Recognition

KernelNet：用于深度生成建模的数据依赖的内核参数化

KernelNet：用于深度生成建模的数据依赖的内核参数化

Machine Learning

AP-Perf：在可微学习中整合通用的性能指标

AP-Perf：在可微学习中整合通用的性能指标

Deep Learning

高效的卷积神经网络用于基于深度的多姿态估计

高效的卷积神经网络用于基于深度的多姿态估计

Computer Vision and Pattern Recognition

Conclusion-Supplement Answer Generation for Non-Factoid Questions

Conclusion-Supplement Answer Generation for Non-Factoid Questions

Natural Language Processing

多域图像分割的对抗性归一化

多域图像分割的对抗性归一化

Machine Learning

通过可赋予的反例实现深度神经网络指纹识别

通过可赋予的反例实现深度神经网络指纹识别

Deep Learning

Attention Is All You Need — Transformer 奠基之作

Attention Is All You Need — Transformer 奠基之作

Large Language Model

BERT：Deep Bidirectional Transformers for Language Understanding

BERT：Deep Bidirectional Transformers for Language Understanding

Large Language Model

GPT-3：Language Models are Few-Shot Learners

GPT-3：Language Models are Few-Shot Learners

Large Language Model

T5：Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer

T5：Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer

Large Language Model

InstructGPT：Training Language Models to Follow Instructions with Human Feedback

InstructGPT：Training Language Models to Follow Instructions with Human Feedback

Large Language Model

Constitutional AI：Harmlessness from AI Feedback（Claude 对齐论文）

Constitutional AI：Harmlessness from AI Feedback（Claude 对齐论文）

Large Language Model

Direct Preference Optimization (DPO)：Your Language Model is Secretly a Reward Model

Direct Preference Optimization (DPO)：Your Language Model is Secretly a Reward Model

Large Language Model

LLaMA：Open and Efficient Foundation Language Models

LLaMA：Open and Efficient Foundation Language Models

Large Language Model

LLaMA 2：Open Foundation and Fine-Tuned Chat Models

LLaMA 2：Open Foundation and Fine-Tuned Chat Models

Large Language Model

Mistral 7B — 高效开源小模型新标杆

Mistral 7B — 高效开源小模型新标杆

Large Language Model

Mixtral of Experts — 稀疏混合专家架构实践

Mixtral of Experts — 稀疏混合专家架构实践

Large Language Model

LoRA：Low-Rank Adaptation of Large Language Models

LoRA：Low-Rank Adaptation of Large Language Models

Large Language Model

QLoRA：Efficient Finetuning of Quantized LLMs

QLoRA：Efficient Finetuning of Quantized LLMs

Large Language Model

Prefix-Tuning：Optimizing Continuous Prompts for Generation

Prefix-Tuning：Optimizing Continuous Prompts for Generation

Large Language Model

RAG：Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks

RAG：Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks

Large Language Model

Chain-of-Thought Prompting Elicits Reasoning in Large Language Models

Chain-of-Thought Prompting Elicits Reasoning in Large Language Models

Large Language Model

ReAct：Synergizing Reasoning and Acting in Language Models

ReAct：Synergizing Reasoning and Acting in Language Models

Large Language Model

Tree of Thoughts：Deliberate Problem Solving with Large Language Models

Tree of Thoughts：Deliberate Problem Solving with Large Language Models

Large Language Model

Mamba：Linear-Time Sequence Modeling with Selective State Spaces

Mamba：Linear-Time Sequence Modeling with Selective State Spaces

Large Language Model

GPT-4 Technical Report

GPT-4 Technical Report

Large Language Model

DeepSeek-R1：Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

DeepSeek-R1：Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Large Language Model

Phi-3 Technical Report：A Highly Capable Language Model Locally on Your Phone

Phi-3 Technical Report：A Highly Capable Language Model Locally on Your Phone

Large Language Model

CLIP：Learning Transferable Visual Models From Natural Language Supervision

CLIP：Learning Transferable Visual Models From Natural Language Supervision

Multimodal

DALL-E：Zero-Shot Text-to-Image Generation

DALL-E：Zero-Shot Text-to-Image Generation

Multimodal

Flamingo：A Visual Language Model for Few-Shot Learning

Flamingo：A Visual Language Model for Few-Shot Learning

Multimodal

LLaVA：Visual Instruction Tuning

LLaVA：Visual Instruction Tuning

Multimodal

ImageBind：One Embedding Space To Bind Them All

ImageBind：One Embedding Space To Bind Them All

Multimodal

Gemini：A Family of Highly Capable Multimodal Models

Gemini：A Family of Highly Capable Multimodal Models

Multimodal

DDPM：Denoising Diffusion Probabilistic Models

DDPM：Denoising Diffusion Probabilistic Models

Diffusion Models

LDM / Stable Diffusion：High-Resolution Image Synthesis with Latent Diffusion Models

LDM / Stable Diffusion：High-Resolution Image Synthesis with Latent Diffusion Models

Diffusion Models

DALL-E 2：Hierarchical Text-Conditional Image Generation with CLIP Diffusion

DALL-E 2：Hierarchical Text-Conditional Image Generation with CLIP Diffusion

Diffusion Models

DiT：Scalable Diffusion Models with Transformers

DiT：Scalable Diffusion Models with Transformers

Diffusion Models

Score-Based Generative Modeling Through Stochastic Differential Equations

Score-Based Generative Modeling Through Stochastic Differential Equations

Diffusion Models

Sora：Video Generation as World Simulators

Sora：Video Generation as World Simulators

Diffusion Models

PPO：Proximal Policy Optimization Algorithms

PPO：Proximal Policy Optimization Algorithms

Reinforcement Learning

DQN：Human-Level Control through Deep Reinforcement Learning

DQN：Human-Level Control through Deep Reinforcement Learning

Reinforcement Learning

SAC：Soft Actor-Critic — Off-Policy Maximum Entropy Deep Reinforcement Learning

SAC：Soft Actor-Critic — Off-Policy Maximum Entropy Deep Reinforcement Learning

Reinforcement Learning

AlphaFold2：Highly Accurate Protein Structure Prediction with AlphaFold

AlphaFold2：Highly Accurate Protein Structure Prediction with AlphaFold

Reinforcement Learning

GRPO：Group Relative Policy Optimization（DeepSeek-R1 训练方法）

GRPO：Group Relative Policy Optimization（DeepSeek-R1 训练方法）

Reinforcement Learning

DreamerV3：Mastering Diverse Domains through World Models

DreamerV3：Mastering Diverse Domains through World Models

Reinforcement Learning

Algolia