Getting Started
Installation
Docker
Verification
Using Composer
Functional API
MosaicML Trainer
Welcome Tour
Our First Method!
A Simple Instrumented Trainer
Introducing… Events, Engines, and State
Next: The MosaicML Trainer
Custom Models and Datasets
Models
Datasets
Trainer init
Trainer with YAHP
Distributed Training (DDP)
Single-Node Example
Detailed Usage
core
composer.Algorithm
composer.Callback
composer.Engine
Trace
composer.Event
Events List
API Reference
composer.Logger
composer.State
composer.core.surgery
composer.types
Tensor Types
Batch Types
Dataset and Data Loader Types
Trainer Types
Miscellaneous Types
composer
composer.algorithms
Alibi
Augmix
BlurPool
Channels Last
ColOut
CutOut
Ghost Batch Normalization
Label Smoothing
Layer Freezing
MixUp
Progressive Resizing
RandAugment
Sequence Length Warmup
Sharpness-Aware Minimization
Scaling the Learning Rate Schedule
Selective Backpropagation
Squeeze-and-Excitation
Stochastic Depth
Stochastic Weight Averaging
composer.callbacks
Callbacks
Callback Hyperparameters
composer.datasets
Base Classes and Hyperparameters
Datasets
composer.functional
composer.loggers
Backends
Backend Hyperparameters
composer.models
Base Models
Image Models
Language Models
Metrics and Loss Functions
composer.optim
DecoupledSGDW
DecoupledAdamW
composer.optim.scheduler
WarmUpLR
ConstantLR
ComposedScheduler
composer.trainer
Examples
Trainer Hparams
API Reference
composer.trainer.devices
Devices
Device Dataloaders
Methods Library
ALiBi
TL;DR
Attribution
Applicable Settings
Hyperparameters
Example Effects
Implementation Details
Suggested Hyperparameters
Considerations
Composability
Code
AugMix
TL;DR
Attribution
Applicable Settings
Hyperparameters
Example Effects
Implementation Details
Suggested Hyperparameters
Considerations
Composability
Code
BlurPool
TL;DR
Attribution
Code and Hyperparameters
Applicable Settings
Example Effects
Implementation Details
Suggested Hyperparameters
Considerations
Composability
Acknowledgments
Code
Channels Last
TL;DR
Attribution
Applicable Settings
Hyperparameters
Example Effects
Implementation Details
Considerations
Composability
Code
ColOut
TL;DR
Attribution
Applicable Settings
Hyperparameters
Example Effects
Implementation Details
Suggested Hyperparameters
Considerations
Composability
Code
Cutout
TL;DR
Attribution
Applicable Settings
Hyperparameters
Example Effects
Implementation Details
Suggested Hyperparameters
Considerations
Composability
Code
Decoupled Weight Decay
TL;DR
Attribution
Code and Hyperparameters
Applicable Settings
Implementation Details
Considerations
Composability
Ghost BatchNorm
TL;DR
Attribution
Hyperparameters
Applicable Settings
Example Effects
Implementation Details
Suggested Hyperparameters
Considerations
Composability
Code
Label Smoothing
TL;DR
Attribution
Applicable Settings
Hyperparameters
Example Effects
Implementation Details
Suggested Hyperparameters
Considerations
Composability
Code
Layer Freezing
TL;DR
Attribution
Hyperparameters
Applicable Settings
Example Effects
Implementation Details
Suggested Hyperparameters
Considerations
Composability
Code
MixUp
TL;DR
Attribution
Hyperparameters
Example Effects
Implementation Details
Suggested Hyperparameters
Considerations
Composability
Code
Progressive Image Resizing
TL;DR
Attribution
Applicable Settings
Hyperparameters
Example Effects
Implementation Details
Suggested Hyperparameters
Considerations
Composability
Detailed Results
Code
RandAugment
TL;DR
Attribution
Applicable Settings
Hyperparameters
Example Effects
Implementation Details
Suggested Hyperparameters
Considerations
Composability
Code
Scale Schedule
TL;DR
Attribution
Hyperparameters
Example Effects
Implementation Details
Suggested Hyperparameters
Composability
Code
Scaling Laws
TL;DR
Attribution
Applicable Settings
Hyperparameters
Detailed Results
Considerations
Effects & Implications
Selective Backprop
TL;DR
Attribution
Applicable Settings
Hyperparameters
Example Effects
Implementation Details
Suggested Hyperparameters
Considerations
Composability
Detailed Results
Code
Sequence Length Warmup
TL;DR
Hyperparameters
Applicable Settings
Effects
Implementation Details
Suggested Hyperparameters
Considerations
Composition
Attribution
Code
Sharpness Aware Minimization
TL;DR
Attribution
Code and Hyperparameters
Applicable Settings
Example Effects
Implementation Details
Suggested Hyperparameters
Considerations
Composability
Code
Squeeze-and-Excitation
TL;DR
Attribution
Code and Hyperparameters
Applicable Settings
Example Effects
Implementation Details
Suggested Hyperparameters
Considerations
Composability
Code
Stochastic Depth (Block-Wise)
TL;DR
Attribution
Applicable Settings
Hyperparameters
Example Effects
Implementation Details
Suggested Hyperparameters
Considerations
Composability
Code
Stochastic Depth (Sample-Wise)
TL;DR
Attribution
Hyperparameters
Applicable Settings
Example Effects
Implementation Details
Suggested Hyperparameters
Considerations
Composability
Code
Stochastic Weight Averaging
TL;DR
Attribution
Applicable Settings
Hyperparameters
Example Effects
Implementation Details
Considerations
Composability
Model Library
CIFAR ResNet
Overview
Attribution
Architecture
Family members
Default Training Hyperparameters
EfficientNet
Overview
Attribution
Architecture
Family members
Default Training Hyperparameters
GPT-2
Overview
Attribution
Architecture
Family Members
Implementation Details
Exploring Tradeoffs Between Quality and Training Speed / Cost
ImageNet ResNet
Overview
Attribution
Architecture
Family Members
Implementation details
Default Training Hyperparameters
UNet
Overview
Attribution
Architecture
Implementation Details
Exploring Tradeoffs Between Quality and Training Speed/Cost
MosaicML
»
composer.optim
Edit on GitHub
composer.optim
DecoupledSGDW
DecoupledAdamW
composer.optim.scheduler
WarmUpLR
ConstantLR
ComposedScheduler
Read the Docs
v: v0.3.0
Versions
latest
stable
v0.3.1
v0.3.0
v0.2.4
v0.2.3
Downloads
On Read the Docs
Project Home
Builds