Introduction to Keras¶

Kears_Logo

Keras is a Python deep-learning framework authored by Francois Chollet that provides a convenient way

to define
and train

deep-learning model.

or simply
Keras: an API for specifying & training differentiable programs^[1]

It is the official high-level API of TensorFlow¶

tf.keras

Keras has the following key features:

Inherits all pros of the backend ML library (No need to worry about Eager )
It has a user-friendly API that makes it easy to quickly prototype deep-learning models.
It has built-in support for various neural network architecturs
- convolutional networks (for computer vision)
- recurrent networks (for sequence processing)
- and any combination of both

Interests in Keras [2]¶

stats

Who or what makes up Keras?¶

Contributors
Keras does not execute low-level operations.
Instead it relies on a specialized, well-optimized tensor library to handle low level operations e.g TensorFlow
For handling operations such as tensor manipulation and differentiation, serving as the backend engine of Keras.

Keras handles the problem in a modular way.

Backend

"""
importing modules
"""
from keras import backend as BackEnd # the module that allows us to manipulate our Keras backend
import os # library that will give us system access to the keras backend file
from importlib import reload # the library that will we will use reload a function

Using TensorFlow backend.

Switching to CNTK^[3] as our backend.

"""
# setting our backend to CNTK 
i.e Microsoft Cognitive toolkit
"""
os.environ['KERAS_BACKEND'] = "cntk" 
reload(BackEnd)

Using CNTK backend

<module 'keras.backend' from 'c:\\intelpython3\\lib\\site-packages\\keras\\backend\\__init__.py'>

Switching to Theano^[4] as our backend.

"""
# setting our backend to Theano
developed by the MILA lab at Université de Montréal,
"""
os.environ['KERAS_BACKEND'] = "theano"
reload(BackEnd)

Using Theano backend.

<module 'keras.backend' from 'c:\\intelpython3\\lib\\site-packages\\keras\\backend\\__init__.py'>

"""
Let's make life easier by making a function;
this function will let us just use one line to change out backend
"""
def set_keras_backend(backend):# set_keras_backend will be the name of our function
                               # the argument we will take in will be our backend framework
    if BackEnd.backend() != backend: 
        os.environ['KERAS_BACKEND'] = backend
        reload(BackEnd)
        assert BackEnd.backend() == backend

set_keras_backend("tensorflow")

Using TensorFlow backend.

Scaling on the job market

Keras Users

Who is using Keras?¶

Companies

Distributed Computing¶

Distrtibuted

How to use Keras¶

Keras is a simple but powerful tool or users of every experience level¶

API

The Sequential API¶

import keras
from keras import layers

model = keras.Sequential()

model.add(layers.Dense(20, activation="relu", input_shape=(10,)))
model.add(layers.Dense(20, activation="relu",))
model.add(layers.Dense(10, activation="softmax"))

model.fit(x, y, epochs=10, batch_size=32)

The Functional API¶

import keras
from keras import layers

inputs = keras.Input(shape=(10,))

x = layers.Dense(20, activation="relu")(x)
x = layers.Dense(20, activation="relu")(x)

outputs = layers.Dense(10, activation="softmax")(x)

model = keras.Model(inputs, outputs)

model.fit(x, y, epochs=10, batch_size=32)

Model subclassing¶

import keras
from keras import layers

class MyModel(keras.Model):
    def__init__(self):
        super(MyModel, self).__init__()
        self.dense1 = layers.Dense(20, activation="relu")
        self.dense2 = layers.Dense(20, activation="relu")
        self.dense3 = layers.Dense(10, activation="softmax")
        
    def call(self, inputs):
        x = self.dense1(x)
        x = self.dense2(x)
        return self.dense3(x)

model = MyModel()
model.fit(x, y, epochs, batch_size=32)

MNIST¶

Mnist

# mnist comes with keras as numpy arrays
from keras.datasets import mnist

# loading the training and test set images along with the labels
(train_images, train_labels), (test_images, test_labels) = mnist.load_data()

# to see the number of training images and dimensions
train_images.shape

(60000, 28, 28)

# viewing the number of labels "training"
len(train_labels)

60000

# to view the stored form and stored format of the training labels
train_labels

array([5, 0, 4, ..., 5, 6, 8], dtype=uint8)

We move onto the test data now¶

# to see the number of testing images and dimensions
test_images.shape

(10000, 28, 28)

# viewing the number of labels "testing"
len(test_labels)

10000

# to view the stored form and stored format of the testing labels
test_labels

array([7, 2, 1, ..., 4, 5, 6], dtype=uint8)

About the network architecture¶

from keras import models
from keras import layers

network = models.Sequential()

# the sequential model is connected to 512 neurons in the network 
network.add(layers.Dense(512, activation='relu', input_shape=(28 * 28,)))

# Each score will be the probability that the current digit image belongs to
# one of our 10 digit classes.
network.add(layers.Dense(10, activation='softmax'))
# it will return an array of 10 probability scores 
# (summing to 1)

-> A loss function—How the network will be able to measure its performance on the training data, and thus how it will be able to steer itself in the right direction.

-> An optimizer—The mechanism through which the network will update itself based on the data it sees and its loss function.

-> Metrics to monitor during training and testing—Here, we’ll only care about accuracy (the fraction of the images that were correctly classified).

The compilation step¶

network.compile(optimizer='rmsprop',
                loss='categorical_crossentropy',
                metrics=['accuracy'])

Before training, we’ll preprocess the data by reshaping it into the shape the network expects and scaling it so that all values are in the [0, 1] interval.

Previously, our training images, for instance, were stored in an array of shape (60000, 28, 28) of type uint8 with values in the [0, 255] interval.

# We transform it into a float32 array of
# shape (60000, 28 * 28) with values between 0 and 1.

train_images = train_images.reshape((60000, 28 * 28))
train_images = train_images.astype('float32') / 255

# dividing by 255 is a form of data normalisation
test_images = test_images.reshape((10000, 28 * 28))
test_images = test_images.astype('float32') / 255

# Linking the labels and images

from keras.utils import to_categorical
# We need to categorically encode the labels

train_labels = to_categorical(train_labels)
test_labels = to_categorical(test_labels)

# We’re now ready to train the network, which in Keras is done via a call to the net-
# work’s fit method—we fit the model to its training data

#network.fit(train_images, train_labels, epochs=10, batch_size=128)

network.fit(train_images, train_labels,batch_size=128, epochs=10, verbose=1, validation_split=0.1)

Train on 54000 samples, validate on 6000 samples
Epoch 1/10
54000/54000 [==============================] - 4s 73us/step - loss: 0.2777 - acc: 0.9193 - val_loss: 0.1233 - val_acc: 0.9630
Epoch 2/10
54000/54000 [==============================] - 4s 73us/step - loss: 0.1136 - acc: 0.9666 - val_loss: 0.0958 - val_acc: 0.9733
Epoch 3/10
54000/54000 [==============================] - 4s 75us/step - loss: 0.0748 - acc: 0.9777 - val_loss: 0.0780 - val_acc: 0.9788
Epoch 4/10
54000/54000 [==============================] - 4s 76us/step - loss: 0.0534 - acc: 0.9838 - val_loss: 0.0851 - val_acc: 0.9732
Epoch 5/10
54000/54000 [==============================] - 4s 81us/step - loss: 0.0397 - acc: 0.9882 - val_loss: 0.0647 - val_acc: 0.9815
Epoch 6/10
54000/54000 [==============================] - 4s 76us/step - loss: 0.0305 - acc: 0.9905 - val_loss: 0.0585 - val_acc: 0.9832
Epoch 7/10
54000/54000 [==============================] - 4s 79us/step - loss: 0.0234 - acc: 0.9930 - val_loss: 0.0611 - val_acc: 0.9837
Epoch 8/10
54000/54000 [==============================] - 5s 84us/step - loss: 0.0175 - acc: 0.9949 - val_loss: 0.0704 - val_acc: 0.9825
Epoch 9/10
54000/54000 [==============================] - 5s 84us/step - loss: 0.0130 - acc: 0.9965 - val_loss: 0.0714 - val_acc: 0.9823
Epoch 10/10
54000/54000 [==============================] - 5s 89us/step - loss: 0.0106 - acc: 0.9967 - val_loss: 0.0712 - val_acc: 0.9827

<keras.callbacks.History at 0x175106e63c8>

# Now let’s check that the model performs well on the test set
test_loss, test_acc = network.evaluate(test_images, test_labels)
print('test_acc:', test_acc)

10000/10000 [==============================] - 1s 50us/step
test_acc: 0.9821

Thank you¶

Deep Analytics

visit their website

Like their Facebook page

School of AI

Join our Facebook Group