CIFAR10 image classification#

The CNN is composed of two layers of convolution with 64 kernels each, followed by two fully connected layers that have 128 and 10 units, respectively. Pooling layers are used to reduce the dimensions of the feature maps of convolution layers.

Parameter are also tuned • Increasing the number of epochs generally increases the accuracy and lowers the loss • Increasing the number of layers increases the accuracy • Increasing the number of neurons per layer also increases the accuracy

raise SystemExit("Stop right there!");

An exception has occurred, use %tb to see the full traceback.

SystemExit: Stop right there!

Importing libraries and packages#

# System
import os

# Mathematical operations and data manipulation
import numpy as np

# Modelling
from tensorflow.keras.datasets import cifar10
import tensorflow as tf
import keras
from tensorflow.keras.preprocessing.image import ImageDataGenerator
from tensorflow.keras import layers

# Plotting
import matplotlib.pyplot as plt

%matplotlib inline

print("Tensorflow version:", tf.__version__)
print("Keras version:", keras.__version__)

Tensorflow version: 2.4.1
Keras version: 2.4.3

os.environ["TF_CPP_MIN_LOG_LEVEL"] = "2"

Loading dataset#

The CIFAR-10 dataset (Canadian Institute for Advanced Research) is composed of 60,000 images of 10 different classes: airplanes, cars, birds, cats, deer, dogs, frogs, horses, ships, and trucks.

# Loading the dataset's train and test data
(X_train, Y_train), (X_test, Y_test) = cifar10.load_data()

print("X_train Shape: {}".format(X_train.shape))
print("X_test Shape: {}".format(X_test.shape))
print("Y_train Shape: {}".format(Y_train.shape))
print("Y_test Shape: {}".format(Y_test.shape))

X_train Shape: (50000, 32, 32, 3)
X_test Shape: (10000, 32, 32, 3)
Y_train Shape: (50000, 1)
Y_test Shape: (10000, 1)

# Reshaping  2D to 1D
Y_train = Y_train.reshape(
    -1,
)
Y_test = Y_test.reshape(
    -1,
)

Exploring dataset#

classes = [
    "airplane",
    "automobile",
    "bird",
    "cat",
    "deer",
    "dog",
    "frog",
    "horse",
    "ship",
    "truck",
]

def plot_sample(x, y, index):
    plt.figure(figsize=(15, 2))
    plt.imshow(x[index])
    plt.xlabel(classes[y[index]])


plot_sample(X_train, Y_train, 6)

../../_images/c281e058ef3d29216b34dfda672f9ea8a95d52740abd1216ae0b6010db42eb93.png

for i in range(10):
    plot_sample(X_train, Y_train, i)

../../_images/2737907c21c02976f3057c25d557397de1d501c30ee25b8f346f6f81622f2ddc.png

../../_images/c6d0b355f98ab4ee357f2309c47e188c1125c64389183c450403afab24e38284.png

../../_images/5bf48fca9c1b1cb1863bc6de61f208b1aef44c967b453d6a75010ead5f3c037c.png

../../_images/512195293777c331ebc96481558eb8603a264f4235a9c2302099b543f4bc6bcb.png

../../_images/57bbc33b221e6cc334c65d8c54648d54171bc44937cf4cb6cc4298ca15e249a1.png

../../_images/d167ade74ff7ece8e8bb48032bc41d40d9cf0697c453689ee61eabe2e8d5d541.png

../../_images/c281e058ef3d29216b34dfda672f9ea8a95d52740abd1216ae0b6010db42eb93.png

../../_images/860233949963a55af6c3e2016a78ea1706c7ac77835a76fac69157b253ea152b.png

../../_images/e000e1006c9243421b16b2fb1bfa8e10cb2cea1f795763e88352e92f524872d4.png

../../_images/00a5e7c11ad1ca34cf37ec2a1247566ab416f6ed7efc603e6aa80586c9fdfa3c.png

Data generators#

# Instantiating two ImageDataGenerator classes: all the image pixels
# are in a range from 0 to 255, use data augmentation for train dataset
data_gen_train = ImageDataGenerator(
    rescale=1.0 / 255,
    width_shift_range=0.1,
    height_shift_range=0.1,
    horizontal_flip=True,
)
data_gen_valid = ImageDataGenerator(rescale=1.0 / 255)

batch_size = 16
img_height = 32
img_width = 32

# Create data generators
train_generator = data_gen_train.flow(X_train, Y_train, batch_size=batch_size)
valid_generator = data_gen_valid.flow(X_test, Y_test, batch_size=batch_size)

Training base model#

# Setting 8 as the seed for numpy and tensorflow
np.random.seed(8)
tf.random.set_seed(8)

# Instantiate a tf.keras.Sequential() class
model = tf.keras.Sequential(
    [
        layers.Conv2D(
            32, 3, activation="relu", input_shape=(img_height, img_width, 3)
        ),
        layers.MaxPooling2D(pool_size=(2, 2)),
        layers.Conv2D(64, 3, activation="relu"),
        layers.MaxPooling2D(pool_size=(2, 2)),
        layers.Dropout(0.5),
        layers.Conv2D(128, 3, activation="relu"),
        layers.MaxPooling2D(pool_size=(2, 2)),
        layers.Dropout(0.5),
        layers.Flatten(),
        layers.Dense(128, activation="relu"),
        layers.Dense(10, activation="softmax"),
    ]
)

optimizer = tf.keras.optimizers.Adam(0.001)

# Providing training parameters using the compile method
model.compile(
    loss="sparse_categorical_crossentropy",
    optimizer=optimizer,
    metrics=["accuracy"],
)

# Inspecting the model configuration using the summary function
model.summary()

Model: "sequential"
_________________________________________________________________
Layer (type)                 Output Shape              Param #   
=================================================================
conv2d (Conv2D)              (None, 30, 30, 32)        896       
_________________________________________________________________
max_pooling2d (MaxPooling2D) (None, 15, 15, 32)        0         
_________________________________________________________________
conv2d_1 (Conv2D)            (None, 13, 13, 64)        18496     
_________________________________________________________________
max_pooling2d_1 (MaxPooling2 (None, 6, 6, 64)          0         
_________________________________________________________________
dropout (Dropout)            (None, 6, 6, 64)          0         
_________________________________________________________________
conv2d_2 (Conv2D)            (None, 4, 4, 128)         73856     
_________________________________________________________________
max_pooling2d_2 (MaxPooling2 (None, 2, 2, 128)         0         
_________________________________________________________________
dropout_1 (Dropout)          (None, 2, 2, 128)         0         
_________________________________________________________________
flatten (Flatten)            (None, 512)               0         
_________________________________________________________________
dense (Dense)                (None, 128)               65664     
_________________________________________________________________
dense_1 (Dense)              (None, 10)                1290      
=================================================================
Total params: 160,202
Trainable params: 160,202
Non-trainable params: 0
_________________________________________________________________

# Train the model by calling the fit method
history = model.fit(
    train_generator,
    steps_per_epoch=len(X_train) // batch_size,
    epochs=20,
    validation_data=valid_generator,
    validation_steps=len(X_test) // batch_size,
)

Epoch 1/20
3125/3125 [==============================] - 226s 72ms/step - loss: 1.8943 - accuracy: 0.2881 - val_loss: 1.3835 - val_accuracy: 0.5015
Epoch 2/20
3125/3125 [==============================] - 231s 74ms/step - loss: 1.5147 - accuracy: 0.4474 - val_loss: 1.3152 - val_accuracy: 0.5396
Epoch 3/20
3125/3125 [==============================] - 225s 72ms/step - loss: 1.4232 - accuracy: 0.4845 - val_loss: 1.2126 - val_accuracy: 0.5869
Epoch 4/20
3125/3125 [==============================] - 233s 74ms/step - loss: 1.3581 - accuracy: 0.5071 - val_loss: 1.2017 - val_accuracy: 0.5908
Epoch 5/20
3125/3125 [==============================] - 232s 74ms/step - loss: 1.3336 - accuracy: 0.5223 - val_loss: 1.1047 - val_accuracy: 0.6105
Epoch 6/20
3125/3125 [==============================] - 225s 72ms/step - loss: 1.2782 - accuracy: 0.5466 - val_loss: 1.1320 - val_accuracy: 0.6090
Epoch 7/20
3125/3125 [==============================] - 210s 67ms/step - loss: 1.2562 - accuracy: 0.5509 - val_loss: 1.0624 - val_accuracy: 0.6397
Epoch 8/20
3125/3125 [==============================] - 212s 68ms/step - loss: 1.2436 - accuracy: 0.5578 - val_loss: 1.0339 - val_accuracy: 0.6516
Epoch 9/20
3125/3125 [==============================] - 214s 69ms/step - loss: 1.2178 - accuracy: 0.5694 - val_loss: 1.0233 - val_accuracy: 0.6498
Epoch 10/20
3125/3125 [==============================] - 216s 69ms/step - loss: 1.2072 - accuracy: 0.5698 - val_loss: 1.0720 - val_accuracy: 0.6296
Epoch 11/20
3125/3125 [==============================] - 210s 67ms/step - loss: 1.1815 - accuracy: 0.5811 - val_loss: 0.9665 - val_accuracy: 0.6697
Epoch 12/20
3125/3125 [==============================] - 217s 69ms/step - loss: 1.1762 - accuracy: 0.5846 - val_loss: 0.9531 - val_accuracy: 0.6753
Epoch 13/20
3125/3125 [==============================] - 208s 67ms/step - loss: 1.1564 - accuracy: 0.5912 - val_loss: 0.9342 - val_accuracy: 0.6794
Epoch 14/20
3125/3125 [==============================] - 237s 76ms/step - loss: 1.1562 - accuracy: 0.5945 - val_loss: 0.9413 - val_accuracy: 0.6929
Epoch 15/20
3125/3125 [==============================] - 230s 74ms/step - loss: 1.1439 - accuracy: 0.5976 - val_loss: 0.9294 - val_accuracy: 0.6883
Epoch 16/20
3125/3125 [==============================] - 236s 76ms/step - loss: 1.1401 - accuracy: 0.5958 - val_loss: 0.9286 - val_accuracy: 0.6898
Epoch 17/20
3125/3125 [==============================] - 224s 72ms/step - loss: 1.1248 - accuracy: 0.6043 - val_loss: 0.9105 - val_accuracy: 0.6912
Epoch 18/20
3125/3125 [==============================] - 224s 72ms/step - loss: 1.1183 - accuracy: 0.6038 - val_loss: 0.8786 - val_accuracy: 0.7110
Epoch 19/20
3125/3125 [==============================] - 215s 69ms/step - loss: 1.1168 - accuracy: 0.6086 - val_loss: 0.8851 - val_accuracy: 0.7119
Epoch 20/20
3125/3125 [==============================] - 210s 67ms/step - loss: 1.1166 - accuracy: 0.6069 - val_loss: 0.9053 - val_accuracy: 0.6867

Statistics base model#

# Test the model by calling the evaluate() function
evaluation = model.evaluate(X_test, Y_test)

313/313 [==============================] - 3s 8ms/step - loss: 168.2624 - accuracy: 0.3406

[168.26243591308594, 0.34060001373291016]

def plotloss(history):
    plt.plot(history.history["loss"])
    plt.plot(history.history["val_loss"])
    plt.plot(history.history["accuracy"])
    plt.xlabel("Epochs")
    plt.ylabel("Loss")
    plt.legend(["Train", "Validation", "Accuracy"])
    plt.show()


plotloss(history)

../../_images/e80526bc3939b1d36babfeaeb7b7866d34389824e100bc102e9e30331ea0842c.png

Tuned model with batch normalization#

model_b = tf.keras.Sequential(
    [
        layers.Conv2D(
            64,
            (4, 4),
            activation="relu",
            input_shape=(img_height, img_width, 3),
            padding="same",
        ),
        layers.BatchNormalization(),
        layers.Conv2D(
            64,
            (4, 4),
            activation="relu",
            input_shape=(img_height, img_width, 3),
            padding="same",
        ),
        layers.BatchNormalization(),
        layers.MaxPooling2D(pool_size=(2, 2)),
        layers.Dropout(0.2),
        layers.Conv2D(
            128,
            (4, 4),
            activation="relu",
            input_shape=(img_height, img_width, 3),
            padding="same",
        ),
        layers.BatchNormalization(),
        layers.Conv2D(
            128,
            (4, 4),
            activation="relu",
            input_shape=(img_height, img_width, 3),
            padding="same",
        ),
        layers.BatchNormalization(),
        layers.MaxPooling2D(pool_size=(2, 2)),
        layers.Dropout(0.25),
        layers.Conv2D(
            128,
            (4, 4),
            activation="relu",
            input_shape=(img_height, img_width, 3),
            padding="same",
        ),
        layers.BatchNormalization(),
        layers.Conv2D(
            128,
            (4, 4),
            activation="relu",
            input_shape=(img_height, img_width, 3),
            padding="same",
        ),
        layers.BatchNormalization(),
        layers.MaxPooling2D(pool_size=(2, 2)),
        layers.Dropout(0.35),
        layers.Flatten(),
        layers.Dense(265, activation="relu"),
        layers.BatchNormalization(),
        layers.Dropout(0.5),
        layers.Dense(10, activation="softmax"),
    ]
)

model_b.compile(
    loss="sparse_categorical_crossentropy",
    optimizer=optimizer,
    metrics=["accuracy"],
)

# Inspecting the model configuration using the summary function
model_b.summary()

Model: "sequential_1"
_________________________________________________________________
Layer (type)                 Output Shape              Param #   
=================================================================
conv2d_3 (Conv2D)            (None, 32, 32, 64)        3136      
_________________________________________________________________
batch_normalization (BatchNo (None, 32, 32, 64)        256       
_________________________________________________________________
conv2d_4 (Conv2D)            (None, 32, 32, 64)        65600     
_________________________________________________________________
batch_normalization_1 (Batch (None, 32, 32, 64)        256       
_________________________________________________________________
max_pooling2d_3 (MaxPooling2 (None, 16, 16, 64)        0         
_________________________________________________________________
dropout_2 (Dropout)          (None, 16, 16, 64)        0         
_________________________________________________________________
conv2d_5 (Conv2D)            (None, 16, 16, 128)       131200    
_________________________________________________________________
batch_normalization_2 (Batch (None, 16, 16, 128)       512       
_________________________________________________________________
conv2d_6 (Conv2D)            (None, 16, 16, 128)       262272    
_________________________________________________________________
batch_normalization_3 (Batch (None, 16, 16, 128)       512       
_________________________________________________________________
max_pooling2d_4 (MaxPooling2 (None, 8, 8, 128)         0         
_________________________________________________________________
dropout_3 (Dropout)          (None, 8, 8, 128)         0         
_________________________________________________________________
conv2d_7 (Conv2D)            (None, 8, 8, 128)         262272    
_________________________________________________________________
batch_normalization_4 (Batch (None, 8, 8, 128)         512       
_________________________________________________________________
conv2d_8 (Conv2D)            (None, 8, 8, 128)         262272    
_________________________________________________________________
batch_normalization_5 (Batch (None, 8, 8, 128)         512       
_________________________________________________________________
max_pooling2d_5 (MaxPooling2 (None, 4, 4, 128)         0         
_________________________________________________________________
dropout_4 (Dropout)          (None, 4, 4, 128)         0         
_________________________________________________________________
flatten_1 (Flatten)          (None, 2048)              0         
_________________________________________________________________
dense_2 (Dense)              (None, 265)               542985    
_________________________________________________________________
batch_normalization_6 (Batch (None, 265)               1060      
_________________________________________________________________
dropout_5 (Dropout)          (None, 265)               0         
_________________________________________________________________
dense_3 (Dense)              (None, 10)                2660      
=================================================================
Total params: 1,536,017
Trainable params: 1,534,207
Non-trainable params: 1,810
_________________________________________________________________

# Train the model by calling the fit method
history_b = model_b.fit(
    train_generator,
    steps_per_epoch=len(X_train) // batch_size,
    epochs=10,
    validation_data=valid_generator,
    validation_steps=len(X_test) // batch_size,
)

Epoch 1/10
3125/3125 [==============================] - 818s 261ms/step - loss: 2.0562 - accuracy: 0.2889 - val_loss: 1.5065 - val_accuracy: 0.4551
Epoch 2/10
3125/3125 [==============================] - 853s 273ms/step - loss: 1.3733 - accuracy: 0.5116 - val_loss: 1.0796 - val_accuracy: 0.6221
Epoch 3/10
3125/3125 [==============================] - 866s 277ms/step - loss: 1.1067 - accuracy: 0.6141 - val_loss: 0.8818 - val_accuracy: 0.6986
Epoch 4/10
3125/3125 [==============================] - 787s 252ms/step - loss: 0.9181 - accuracy: 0.6862 - val_loss: 0.7880 - val_accuracy: 0.7316
Epoch 5/10
3125/3125 [==============================] - 725s 232ms/step - loss: 0.8071 - accuracy: 0.7275 - val_loss: 0.7703 - val_accuracy: 0.7391
Epoch 6/10
3125/3125 [==============================] - 746s 239ms/step - loss: 0.7414 - accuracy: 0.7481 - val_loss: 0.5918 - val_accuracy: 0.8030
Epoch 7/10
3125/3125 [==============================] - 810s 259ms/step - loss: 0.6739 - accuracy: 0.7743 - val_loss: 0.6155 - val_accuracy: 0.7929
Epoch 8/10
3125/3125 [==============================] - 828s 265ms/step - loss: 0.6351 - accuracy: 0.7880 - val_loss: 0.5448 - val_accuracy: 0.8157
Epoch 9/10
3125/3125 [==============================] - 866s 277ms/step - loss: 0.6016 - accuracy: 0.7979 - val_loss: 0.5519 - val_accuracy: 0.8145
Epoch 10/10
3125/3125 [==============================] - 843s 270ms/step - loss: 0.5680 - accuracy: 0.8099 - val_loss: 0.4912 - val_accuracy: 0.8332

Statistics tuned model with batch normalization#

plotloss(history_b)

../../_images/4975fbac5b07b2a4d2c0a51de3eb2f5e8507a8ec8451613c0dc21727dbcc44da.png

Saving#

tf.keras.models.save_model(
    model_b,
    "./assets/cifar_tuned.model",
    overwrite=True,
    include_optimizer=True,
)