Handwritten Character Classification Pipeline
Timeframe: Oct 2024 – Dec 2024
Stack: TensorFlow · Keras · TFRecords
Overview
Built an EMNIST classification pipeline using TFRecords to parse, pad 28×28 to 32×32, correct rotations, then shuffled, batched (128), and prefetched training samples to minimize memory footprint.
Approach
- Fine-tuned ResNet50V2-based classifier with dropout and batch normalization, unfreezing the top 30% of layers
- Employed mixed bfloat16 precision, dynamic learning-rate reduction, and checkpointing
- Optimized training efficiency by reducing required batches from ~4,171 to 1,500 through strategic shuffling and batching, without sacrificing accuracy
Results
- Achieved 93.55% test accuracy (0.1745 loss)
- Leveraged mixed precision plus multi-core parallelism for faster convergence
