
- Split training data into mini batches
- Find which samples best modify weight
- Gradient Clipping
- Average the gradients to reflect all changes
- Add Noise corresponding to size of mini batch Usually patterns emerge from shared characteristics between people, so DP works