Difference between revisions of "Distributed Tensorflow in Kubernetes"

Revision as of 01:41, 16 November 2018

Distributed Tensorflow (Clustering) can speed up your training. Distributed tensorflow in kubernates make it easy to:

This topic will describe how to make a distributed tensorflow.

You must know the basic concept of distributed tensorflow here: Distributed TensorFlow
You must know how to write a distributed tensorflow training. Ex: train_and_evaluate

1. Create(Download) source & Dockerfile (File:Iris train and eval.zip) and unzip to the same folder.

2. Create training container, where "ecgwc" is the username in dockerhub and "tf-iris:dist" is the container name

$ docker build -t ecgwc/tf-iris:dist .

3. Check if trainig docker is workable.

$ docker run --rm ecgwc/tf-iris:dist

4. Push docker to dockerHub

$ docker push ecgwc/tf-iris:dist

5. Create(Download) yaml file for distributed tensorflow: File:Tf-dist-iris.zip

6. Deploy yaml to k8s

$ kubectl create -f tf-dist-iris.yaml

7. Check training process

@@ Line 15: / Line 15: @@
 == Steps ==
-. Create(Download) source & Dockerfile [[File:Iris train and eval.zip|RTENOTITLE]] and unzip to the same folder.
+. Create(Download) source & Dockerfile ([[File:Iris train and eval.zip|RTENOTITLE]]) and unzip to the same folder.
 . Create training container, where "ecgwc" is the username in dockerhub and "tf-iris:dist" is the container name
 <syntaxhighlight lang="bash">
 $ docker build -t ecgwc/tf-iris:dist .
-</syntaxhighlight>
+</syntaxhighlight>3. Check if trainig docker is workable.<syntaxhighlight lang="bash">
-. Check if trainig docker is workable.
-<syntaxhighlight lang="bash">
 $ docker run --rm ecgwc/tf-iris:dist
-</syntaxhighlight>[[File:Dist tf k8s-1.png|RTENOTITLE]]
+</syntaxhighlight>[[File:Dist tf k8s-1.png|RTENOTITLE]] 4. Push docker to dockerHub<syntaxhighlight lang="bash">
-. Push docker to dockerHub
-<syntaxhighlight lang="bash">
 $ docker push ecgwc/tf-iris:dist
-</syntaxhighlight>
+</syntaxhighlight> 5. Create(Download) yaml file for distributed tensorflow: [[File:Tf-dist-iris.zip|RTENOTITLE]]
-. Create(Download) yaml file for distributed tensorflow: [[File:Tf-dist-iris.zip|RTENOTITLE]]
 . Deploy yaml to k8s
 <syntaxhighlight lang="bash">
 $ kubectl create -f tf-dist-iris.yaml
-</syntaxhighlight>
+</syntaxhighlight> 7. Check training process
-. Check training process
 [[File:Dist tf k8s-2.png|RTENOTITLE]]