王小扬博客

产品

Think

🗒️ Saas的困境

🗒️ AI时代下云厂商的困境

🗒️ 2024技术行业的思考

🗒️ The Romance of Coder

🗒️ 为什么要多元化发展

Git

🗒️ fatal: could not read Username Device not configured

软件开发

🗒️ spring Boot、nestjs、flask web服务框架对比

🗒️ 字节5000WQPS 从DNS到Kubernetes集群负载均衡分析

计算机网络

🗒️ whistle和Clash共存双层代理

🗒️ clash 防止规则覆盖

🗒️ Jenkins实践

🗒️ jenkins docker 容器配置 proxy

🗒️ jenkins 远程 Remote root is not absolute, getting absolute directory from PWD；

🗒️ docker安装的jenkins如何在宿主机启动jar包

🗒️ 先ORDER后JOIN引发乱序，附解决方法

🗒️ MySQL 批量修改表名

🗒️ Mysql OPTIMIZE TABLE

🗒️ RDS清理历史数据释放空间··

🗒️ RDS Mysql历史数据清理

🗒️ The MySQL server is running with the --read-only option so it cannot execute this statement

设计

🗒️ 秒杀系统时间配置、Nginx查看服务器系统时间

🗒️ 微服务技术选型

缓存

🗒️ 二级缓存版本号设计

Docker

🗒️ Docker绑定CPU

🗒️ Jenkins Docker构建存在缓存

🗒️ 容器内获取宿主机名称和容器ID

Node

🗒️ 基础使用

🗒️ node nestjs 异步异常处理

🗒️ mac arm nvm install node14 failing

🗒️ Node CPU100% 问题分析解决

操作系统

🗒️ diebian切换镜像源安装基础软件

🗒️ 程序CPU100% | 内存泄漏通用排查步骤

🗒️ CPU每秒多少次运算

🗒️ windows多个音频同时输出

🗒️ ubuntu禁用挂起

🗒️ mac lrzsz iterm 配置

Java

🗒️ 基于布隆过滤器快速匹配敏感词、关键词、品牌词

🗒️ 项目加密措施

🗒️ Java 模板变量替换——字符串替换器{}、${}、${}map

🗒️ spring动态修改service类的属性

🗒️ java反编译；将class变成java；利用idea进行反编译

🗒️ Java微服务生态系统构建指南

🗒️ HashMap初始化容量计算

🗒️ Maven常见问题

🗒️ mac arm Java maven等环境安装

🗒️ maven-assembly-plugin打包 scope system 级别文件

🗒️ maven打包配置SpringBoot

🗒️ Maven引入本地jar包

🗒️ Redisson延迟队列

🗒️ Java字符串比较 == 和 equals 的区别 intern

🗒️ jar包添加到本地

大前端

🗒️ 升级npm-check-updates

🗒️ overrides 覆盖配置

🗒️ 阿里npm镜像源更新不及时

🗒️ nvm

🗒️ 电商平台产品ID｜CDN与预渲染｜前端边缘计算

🗒️ Session Cookie Jwt Token常见web授权

🗒️ nrm｜npm快速切源

🗒️ patch-package|npm补丁修复

🗒️ windows ESLint: Expected linebreaks to be 'LF' but found 'CRLF'.(linebreak-style)

🗒️ npm install reason: certificate has expired

Nestjs

🗒️ nestjs fastify 频繁重启

🗒️ Nestjs Fastify 上传文件

🗒️ Nestjs fastify 接入Swagger

🗒️ typeorm用法

🗒️ Nestjs Pipe用法

🗒️ Nestjs Middleware

🗒️ nestjs/schedule nestjs定时任务

🗒️ proxy 网络代理

🗒️ class-validator nestjs dto参数校验

🗒️ 优雅关闭k8s pod docker pm2 nestjs bull

🗒️ NestJs bull 用法

🗒️ nestjs Cron @handleCron" because it is defined in a non static provider.

🗒️ nest redis mq

🗒️ nest @Public() 注解免校验 token

其他

PHP

🗒️ PHP配置跨域支持

🗒️ php strtr其他语言实现Node

🗒️ 为什么大家还在用php

🗒️ PHP 实现 redis 分布式锁

SOTA 模型

整理了197个经典SOTA模型，涵盖图像分类、目标检测、推荐系统等13个方向

今天来帮大家回顾一下计算机视觉、自然语言处理等热门研究领域的197个经典SOTA模型，涵盖了图像分类、图像生成、文本分类、强化学习、目标检测、推荐系统、语音识别等13个细分方向。建议大家收藏了慢慢看，下一篇…

https://zhuanlan.zhihu.com/p/659300991

一、图像分类SOTA模型（15个）

1.模型：AlexNet

论文题目：Imagenet Classification with Deep Convolution Neural Network

2.模型：VGG

论文题目：Very Deep Convolutional Networks for Large-Scale Image Recognition

3.模型：GoogleNet

论文题目：Going Deeper with Convolutions

4.模型：ResNet

论文题目：Deep Residual Learning for Image Recognition

5.模型：ResNeXt

论文题目：Aggregated Residual Transformations for Deep Neural Networks

6.模型：DenseNet

论文题目：Densely Connected Convolutional Networks

7.模型：MobileNet

论文题目：MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications

8.模型：SENet

论文题目：Squeeze-and-Excitation Networks

9.模型：DPN

论文题目：Dual Path Networks

10.模型：IGC V1

论文题目：Interleaved Group Convolutions for Deep Neural Networks

11.模型：Residual Attention Network

论文题目：Residual Attention Network for Image Classification

12.模型：ShuffleNet

论文题目：ShuffleNet: An Extremely Efficient Convolutional Neural Network for Mobile Devices

13.模型：MnasNet

论文题目：MnasNet: Platform-Aware Neural Architecture Search for Mobile

14.模型：EfficientNet

论文题目：EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks

15.模型：NFNet

论文题目：MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applic

二、文本分类SOTA模型（12个）

1.模型：RAE

论文题目：Semi-Supervised Recursive Autoencoders for Predicting Sentiment Distributions

2.模型：DAN

论文题目：Deep Unordered Composition Rivals Syntactic Methods for Text Classification

3.模型：TextRCNN

论文题目：Recurrent Convolutional Neural Networks for Text Classification

4.模型：Multi-task

论文题目：Recurrent Neural Network for Text Classification with Multi-Task Learning

5.模型：DeepMoji

论文题目：Using millions of emoji occurrences to learn any-domain representations for detecting sentiment, emotion and sarcasm

6.模型：RNN-Capsule

论文题目：Investigating Capsule Networks with Dynamic Routing for Text Classification

7.模型：TextCNN

论文题目：Convolutional neural networks for sentence classification

8.模型：DCNN

论文题目：A convolutional neural network for modelling sentences

9.模型：XML-CNN

论文题目：Deep learning for extreme multi-label text classification

10.模型：TextCapsule

论文题目：Investigating capsule networks with dynamic routing for text classification

11.模型：Bao et al.

论文题目：Few-shot Text Classification with Distributional Signatures

12.模型：AttentionXML

论文题目：AttentionXML: Label Tree-based Attention-Aware Deep Model for High-Performance Extreme Multi-Label Text Classification

三、文本摘要SOTA模型（17个）

1.模型：CopyNet

论文题目：Incorporating Copying Mechanism in Sequence-to-Sequence Learning

2.模型：SummaRuNNer

论文题目：SummaRuNNer: A Recurrent Neural Network Based Sequence Model for Extractive Summarization of Documen

3.模型：SeqGAN

论文题目：SeqGAN: Sequence Generative Adversarial Nets with Policy Gradient

4.模型：Latent Extractive

论文题目：Neural latent extractive document summarization

5.模型：NEUSUM

论文题目：Neural Document Summarization by Jointly Learning to Score and Select Sentences

6.模型：BERTSUM

论文题目：Text Summarization with Pretrained Encoders

7.模型：BRIO

论文题目：BRIO: Bringing Order to Abstractive Summarization

8.模型：NAM

论文题目：A Neural Attention Model for Abstractive Sentence Summarization

9.模型：RAS

论文题目：Abstractive Sentence Summarization with Attentive Recurrent Neural Networks

10.模型：PGN

论文题目：Get To The Point: Summarization with Pointer-Generator Networks

11.模型：Re3Sum

论文题目：Retrieve, rerank and rewrite: Soft template based neural summarization

12.模型：MTLSum

论文题目：Soft Layer-Specific Multi-Task Summarization with Entailment and Question Generation

13.模型：KGSum

论文题目：Mind The Facts: Knowledge-Boosted Coherent Abstractive Text Summarization

14.模型：PEGASUS

论文题目：PEGASUS: Pre-training with Extracted Gap-sentences for Abstractive Summarization

15.模型：FASum

论文题目：Enhancing Factual Consistency of Abstractive Summarization

16.模型：RNN（ext） + ABS + RL + Rerank

论文题目：Fast Abstractive Summarization with Reinforce-Selected Sentence Rewriting

17.模型：BottleSUM

论文题目：BottleSum: Unsupervised and Self-supervised Sentence Summarization using the Information Bottleneck Principle

四、图像生成SOTA模型（16个）

Progressive Growing of GANs for Improved Quality, Stability, and Variation

A Style-Based Generator Architecture for Generative Adversarial Networks

Analyzing and Improving the Image Quality of StyleGAN

Alias-Free Generative Adversarial Networks

Very Deep VAEs Generalize Autoregressive Models and Can Outperform Them on Images

A Contrastive Learning Approach for Training Variational Autoencoder Priors

StyleGAN-XL: Scaling StyleGAN to Large Diverse Datasets

Diffusion-GAN: Training GANs with Diffusion

Improved Training of Wasserstein GANs

Self-Attention Generative Adversarial Networks

Large Scale GAN Training for High Fidelity Natural Image Synthesis

CSGAN: Cyclic-Synthesized Generative Adversarial Networks for Image-to-Image Transformation

LOGAN: Latent Optimisation for Generative Adversarial Networks

A U-Net Based Discriminator for Generative Adversarial Networks

Instance-Conditioned GAN

Conditional GANs with Auxiliary Discriminative Classifier

五、视频生成SOTA模型（15个）

Temporal Generative Adversarial Nets with Singular Value Clipping

Generating Videos with Scene Dynamics

MoCoGAN: Decomposing Motion and Content for Video Generation

Stochastic Video Generation with a Learned Prior

Video-to-Video Synthesis

Probabilistic Video Generation using Holistic Attribute Control

ADVERSARIAL VIDEO GENERATION ON COMPLEX DATASETS

Sliced Wasserstein Generative Models

Train Sparsely, Generate Densely: Memory-efficient Unsupervised Training of High-resolution Temporal GAN

Latent Neural Differential Equations for Video Generation

VideoGPT: Video Generation using VQ-VAE and Transformers

Diverse Video Generation using a Gaussian Process Trigger

NÜWA: Visual Synthesis Pre-training for Neural visUal World creAtion

StyleGAN-V: A Continuous Video Generator with the Price, Image Quality and Perks of StyleGAN2

Video Diffusion Models

六、强化学习SOTA模型（13个）

Playing Atari with Deep Reinforcement Learning

Deep Reinforcement Learning with Double Q-learning

Continuous control with deep reinforcement learning

Asynchronous Methods for Deep Reinforcement Learning

Proximal Policy Optimization Algorithms

Hindsight Experience Replay

Emergence of Locomotion Behaviours in Rich Environments

ImplicitQuantile Networks for Distributional Reinforcement Learning

Imagination-Augmented Agents for Deep Reinforcement Learning

Neural Network Dynamics for Model-Based Deep Reinforcement Learning with Model-Free Fine-Tuning

Model-based value estimation for efficient model-free reinforcement learning

Model-ensemble trust-region policy optimization

Dynamic Horizon Value Estimation for Model-based Reinforcement Learning

七、语音合成SOTA模型（19个）

TTS Synthesis with Bidirectional LSTM based Recurrent Neural Networks

WaveNet: A Generative Model for Raw Audio

SampleRNN: An Unconditional End-to-End Neural Audio Generation Model

Char2Wav: End-to-end speech synthesis

Deep Voice: Real-time Neural Text-to-Speech

Parallel WaveNet: Fast High-Fidelity Speech Synthesis

Statistical Parametric Speech Synthesis Using Generative Adversarial Networks Under A Multi-task Learning Framework

Tacotron: Towards End-to-End Speech Synthesis

VoiceLoop: Voice Fitting and Synthesis via a Phonological Loop

Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions

Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis

Deep Voice 3: Scaling text-to-speech with convolutional sequence learning

ClariNet Parallel Wave Generation in End-to-End Text-to-Speech

LPCNET: IMPROVING NEURAL SPEECH SYNTHESIS THROUGH LINEAR PREDICTION

Neural Speech Synthesis with Transformer Network

Glow-TTS：A Generative Flow for Text-to-Speech via Monotonic Alignment Search

FLOW-TTS: A NON-AUTOREGRESSIVE NETWORK FOR TEXT TO SPEECH BASED ON FLOW

Conditional variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

PnG BERT: Augmented BERT on Phonemes and Graphemes for Neural TTS

八、机器翻译SOTA模型（18个）

Neural machine translation by jointly learning to align and translate

Multi-task Learning for Multiple Language Translation

Effective Approaches to Attention-based Neural Machine Translation

A Convolutional Encoder Model for Neural Machine Translation

Attention is All You Need

Decoding with Value Networks for Neural Machine Translation

Unsupervised Neural Machine Translation

Phrase-based & Neural Unsupervised Machine Translation

Addressing the Under-translation Problem from the Entropy Perspective

Modeling Coherence for Discourse Neural Machine Translation

Cross-lingual Language Model Pretraining

MASS: Masked Sequence to Sequence Pre-training for Language Generation

FlowSeq: Non-Autoregressive Conditional Sequence Generation with Generative Flow

Multilingual Denoising Pre-training for Neural Machine Translation

Incorporating BERT into Neural Machine Translation

Pre-training Multilingual Neural Machine Translation by Leveraging Alignment Information

Contrastive Learning for Many-to-many Multilingual Neural Machine Translation

Universal Conditional Masked Language Pre-training for Neural Machine Translation

九、文本生成SOTA模型（10个）

Sequence to sequence learning with neural networks

Learning Phrase Representations using RNN Encoder-Decoder for Statistical Machine Translation

Neural machine translation by jointly learning to align and translate

SeqGAN: Sequence Generative Adversarial Nets with Policy Gradient

Attention is all you need

Improving language understanding by generative pre-training

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Cross-lingual Language Model Pretraining

Language Models are Unsupervised Multitask Learners

BART: Denoising Sequence-to-Sequence Pre-training for Natural Language Generation, Translation, and Comprehension

十、语音识别SOTA模型（12个）

A Neural Probabilistic Language Model

Recurrent neural network based language model

Lstm neural networks for language modeling

Hybrid speech recognition with deep bidirectional lstm

Attention is all you need

Improving language understanding by generative pre- training

Bert: Pre-training of deep bidirectional transformers for language understanding

Transformer-XL: Attentive Language Models Beyond a Fixed-Length Context

Lstm neural networks for language modeling

Feedforward sequential memory networks: A new structure to learn long-term dependency

Convolutional, long short-term memory, fully connected deep neural networks

Highway long short-term memory RNNs for distant speech recognition

十一、目标检测SOTA模型（16个）

Rich feature hierarchies for accurate object detection and semantic segmentation

Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition

Fast R-CNN

Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks

Training Region-based Object Detectors with Online Hard Example Mining

R-FCN: Object Detection via Region-based Fully Convolutional Networks

Mask R-CNN

You Only Look Once: Unified, Real-Time Object Detection

SSD: Single Shot Multibox Detector

Feature Pyramid Networks for Object Detection

Focal Loss for Dense Object Detection

Accurate Single Stage Detector Using Recurrent Rolling Convolution

CornerNet: Detecting Objects as Paired Keypoints

M2Det: A Single-Shot Object Detector based on Multi-Level Feature Pyramid Network

Fully Convolutional One-Stage Object Detection

ObjectBox: From Centers to Boxes for Anchor-Free Object Detection

十二、推荐系统SOTA模型（18个）

Learning Deep Structured Semantic Models for Web Search using Clickthrough Data

Deep Neural Networks for YouTube Recommendations

Self-Attentive Sequential Recommendation

Graph Convolutional Neural Networks for Web-Scale Recommender Systems

Learning Tree-based Deep Model for Recommender Systems

Multi-Interest Network with Dynamic Routing for Recommendation at Tmall

PinnerSage: Multi-Modal User Embedding Framework for Recommendations at Pinterest

Eicient Non-Sampling Factorization Machines for Optimal Context-Aware Recommendation

Self-Supervised Multi-Channel Hypergraph Convolutional Network for Social Recommendation

Field-aware Factorization Machines for CTR Prediction

Deep Learning over Multi-field Categorical Data – A Case Study on User Response Prediction

Product-based Neural Networks for User Response Prediction

Wide & Deep Learning for Recommender Systems

Deep & Cross Network for Ad Click Predictions

xDeepFM: Combining Explicit and Implicit Feature Interactions for Recommender Systems

Deep Interest Network for Click-Through Rate Prediction

GateNet:Gating-Enhanced Deep Network for Click-Through Rate Prediction

Package Recommendation with Intra- and Inter-Package Attention Networks

十三、超分辨率分析SOTA模型（16个）

Image Super-Resolution Using Deep Convolutional Networks

Deeply-Recursive Convolutional Network for Image Super-Resolution

Accelerating the Super-Resolution Convolutional Neural Network

Real-Time Single Image and Video Super-Resolution Using an Efficient Sub-Pixel Convolutional Neural Network

Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network

Image Restoration Using Convolutional Auto-encoders with Symmetric Skip Connections

Accurate Image Super-Resolution Using Very Deep Convolutional Networks

Image super-resolution via deep recursive residual network

Deep Laplacian Pyramid Networks for Fast and Accurate Super-Resolution

Image Super-Resolution Using Very Deep Residual Channel Attention Networks

Image Super-Resolution via Dual-State Recurrent Networks

Recovering Realistic Texture in Image Super-resolution by Deep Spatial Feature Transform

Cascade Convolutional Neural Network for Image Super-Resolution

Image Super-Resolution with Cross-Scale Non-Local Attention and Exhaustive Self-Exemplars Mining

Single Image Super-Resolution via a Holistic Attention Network

One-to-many Approach for Improving Super-Resolution

Last update: 2024-4-22