Training Optimization: Memory Efficiency

Memory efficiency techniques for training: Gradient Checkpointing and Mixed Precision Training.