资源算法kubeflow-extensions

kubeflow-extensions

2020-04-10 | |  43 |   0 |   0

Kubeflow Pipeline Templates

This is a repo for the Agile Stacks Kubeflow Pipelines templates. It contains end-to-end tutorials for Kubeflow Pipelines, Keras, and Seldon.

The following Kubeflow extensions are introduced to Kubeflow to address common challenges:

  1. get_secret for managing Kubernets secrets from Jupyter notebooks

  2. S3 filesystem for Kubeflow

  3. Kaniko KFP component for building Docker images

  4. Template magic for Notebooks

  5. Extensions for Notebooks: debugging, variable explorer

  6. Environment configuration for Notebooks based on configmaps and secrets

  7. Continuous deployment for Kubeflow Pipelines

Each template introduces a machine learning project structure that allows to modularize data processing, model definition, model training, validation, and inference tasks.  Using distinct steps makes it possible to rerun only the steps you need, as you tweak and test your workflow. A well-defined, standard project structure helps all team members to understand how a model was created.

In this tutorial we will introduce reusable Machine Learning Pipelines that can be used as a template for your machine learning scenarios.

We will cover the following steps:

  • Learn how to build pipelines for training, monitoring and deployment of deep learning models.

  • Prepare and store training data in S3 buckets or NFS volumes.

  • Build, train, optimize, and deploy models from Jupyter notebooks.

  • Train Sequence to Sequence NLP model using multiple GPUs.

  • Deploy your machine learning models and experiments at scale on AWS Kubernetes service

  • Reduce cost with automation, node autoscaling, and AWS spot instances.

  • Deploy machine learning models on AWS, GCP, or on-prem hardware.

  • Compare results of experiments with monitoring and experiment tracking tools.

  • Automate experiment tracking, hyper parameter optimization, model versioning, deployment to production.

  • Implement simple web applications and send prediction requests to the deployed models using Seldon or Tensorflow Serving.

Sample pipelines, algorithms, and training data sets are provided for solving common problems such as data cleansing and training on a large amount of samples.

Repository layout

tutorials/ - various tutorials and documentation related to the pipeline templates
kernels/ - here we publish Jupyter Notebook custom kernels
applications/ - application templates used to as tutorials and for user application generation by SuperHub
storage/ - Contains storage plugins such as Goofys Flex plugin for S3
nbextensions/ - Jupyter notebook extensions

Automated Deployment Template on Agile Stacks

Pipeline templates can be deployed on AWS and and on-prem Kubernetes clusters. Support for GCP and Azure is coming soon. For step-by-step guide on using this repository please see ML Pipelines Templates tutorial on Agile Stacks site.

Manual Deployment

In this section, you will deploy the tutorial manually on existing Kubernetes cluster. Stay tuned for updates.

Acknowledgments

We will use popular open source frameworks such as Kubeflow, Keras, and Seldon to implement end-to-end ML pipelines that can run on AWS, GCP, or on-premises hardware. The Kubeflow project is designed to simplify the deployment of machine learning projects like Keras and TensorFlow on Kubernetes. These frameworks can leverage multiple GPUs in the Kubernetes cluster to scale machine learning tasks. We will also cover how you can use Kubeflow pipelines to continuously deploy models to production and retrain models on real life data. By deploying the ML model into real world you will learn from feedback loops and quickly improve the model. After implementing this tutorial, you will gain knowledge about best practices for deploying ML algorithms and artifacts, and learn about open source software available for production ML lifecycle management.


License

This repositoriy is distributed under Apache 2 (license text)


上一篇: tfx-kubeflow-pipelines

下一篇:kubeflow-broadmission

用户评价
全部评价

热门资源

  • seetafaceJNI

    项目介绍 基于中科院seetaface2进行封装的JAVA...

  • spark-corenlp

    This package wraps Stanford CoreNLP annotators ...

  • Keras-ResNeXt

    Keras ResNeXt Implementation of ResNeXt models...

  • capsnet-with-caps...

    CapsNet with capsule-wise convolution Project ...

  • shih-styletransfer

    shih-styletransfer Code from Style Transfer ...