Model Summaries
Last updated
Last updated
The model architectures included come from a wide variety of sources. Sources, including papers, original impl (“reference code”) that I rewrote / adapted, and PyTorch impl that I leveraged directly (“code”) are listed below.
Most included models have pretrained weights. The weights are either:
from their original sources
ported by myself from their original impl in a different framework (e.g. Tensorflow models)
trained from scratch using the included training script
The validation results for the pretrained weights are
A more exciting view (with pretty pictures) of the models within timm
can be found at .
Implementation:
Paper: Big Transfer (BiT): General Visual Representation Learning
-
Reference code:
Implementation:
Paper: CSPNet: A New Backbone that can Enhance Learning Capability of CNN
-
Reference impl:
Papers:
ResNet (V1B)
ResNeXt
‘Bag of Tricks’ / Gluon C, D, E, S ResNet variants
Instagram pretrained / ImageNet tuned ResNeXt101
Semi-supervised (SSL) / Semi-weakly Supervised (SWSL) ResNet and ResNeXts
Squeeze-and-Excitation Networks
Code: Added to ResNet base, this is current version going forward, old senet.py
is being deprecated
ECAResNet (ECA-Net)
Implementation:
Paper: Densely Connected Convolutional Networks
-
Code:
Implementation:
Paper:
Code:
Implementation:
Paper: Dual Path Networks
-
My PyTorch code:
Reference code:
Implementation:
Paper: Neural Architecture Design for GPU-Efficient Networks
-
Reference code:
Implementation:
Paper: Deep High-Resolution Representation Learning for Visual Recognition
-
Code:
Implementation:
Paper: Rethinking the Inception Architecture for Computer Vision
-
Code:
Implementation:
Paper: Inception-v4, Inception-ResNet and the Impact of Residual Connections on Learning
-
Code:
Reference code:
Implementation:
Paper: Inception-v4, Inception-ResNet and the Impact of Residual Connections on Learning
-
Code:
Reference code:
Implementation:
Papers: Learning Transferable Architectures for Scalable Image Recognition
-
Code:
Reference code:
Implementation:
Papers: Progressive Neural Architecture Search
-
Code:
Reference code:
Implementation:
EfficientNet NoisyStudent (B0-B7, L2) -
EfficientNet AdvProp (B0-B8) -
EfficientNet (B0-B7) -
EfficientNet-EdgeTPU (S, M, L) -
MixNet -
MNASNet B1, A1 (Squeeze-Excite), and Small -
MobileNet-V2 -
FBNet-C -
Single-Path NAS -
My PyTorch code:
Reference code:
Implementation:
Paper: Searching for MobileNetV3
-
Reference code:
Implementation:
Paper: Designing Network Design Spaces
-
Reference code:
Implementation:
Paper: Making VGG-style ConvNets Great Again
-
Reference code:
Implementation:
Paper: Deep Residual Learning for Image Recognition
-
Code:
Paper: Aggregated Residual Transformations for Deep Neural Networks
-
Code:
Paper: Bag of Tricks for Image Classification with CNNs
-
Code:
Paper: Exploring the Limits of Weakly Supervised Pretraining
-
Weights: (NOTE: CC BY-NC 4.0 License, NOT commercial friendly)
Paper: Billion-scale semi-supervised learning for image classification
-
Weights: (NOTE: CC BY-NC 4.0 License, NOT commercial friendly)
Paper: Squeeze-and-Excitation Networks
-
Paper: ECA-Net: Efficient Channel Attention for Deep CNN
-
Code: Added to ResNet base, ECA module contributed by @VRandme, reference
Implementation:
Paper: Res2Net: A New Multi-scale Backbone Architecture
-
Code:
Implementation:
Paper: ResNeSt: Split-Attention Networks
-
Code:
Implementation:
Paper: ReXNet: Diminishing Representational Bottleneck on CNN
-
Code:
Implementation:
Paper: Selective-Kernel Networks
-
Code: ,
Implementation:
Paper: XNect: Real-time Multi-Person 3D Motion Capture with a Single RGB Camera
-
Code:
Implementation: NOTE: I am deprecating this version of the networks, the new ones are part of resnet.py
Paper: Squeeze-and-Excitation Networks
-
Code:
Implementation:
Paper: TResNet: High Performance GPU-Dedicated Architecture
-
Code:
Implementation:
Paper: Very Deep Convolutional Networks For Large-Scale Image Recognition
-
Reference code:
Implementation:
Paper: An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
-
Reference code and pretrained weights:
Implementation:
Paper: CenterMask : Real-Time Anchor-Free Instance Segmentation
-
Reference code:
Implementation:
Paper: Xception: Deep Learning with Depthwise Separable Convolutions
-
Code:
Implementation:
Paper: Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation
-
Reference code: ,
Implementation:
Paper: Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation
-
Reference code: