资源算法gpt-2-flask-api

gpt-2-flask-api

2020-03-02 | |  64 |   0 |   0

GPT-2 Flask API

Containerising PyTorch models in a repeatable way. Deploy OpenAI's GPT-2 model and expose it over a Flask API. Finally deploy it to AWS Fargate container hosting using CloudFormation.

图片.png

First, before anything else download the model

mkdir models
curl --output models/gpt2-pytorch_model.bin https://s3.amazonaws.com/models.huggingface.co/bert/gpt2-pytorch_model.bin

Local


Run the following to get started with your local python environment

python3 -m venv ./venvsource venv/bin/activate
pip install --upgrade pip
pip install -r requirements.txt

Then run the python flask server using the following

cd deployment
python run_server.py

docker-compose

Setup

docker-compose up --build flask

Go to http://localhost:5000

Shutdown

docker-compose down -v

AWS


First build and push the container to ECR

./container_push.sh

Setup the CloudFormation stack

./cloudformation_deploy.sh

Deploy the stack

aws cloudformation create-stack 
    --stack-name "gpt-2-flask" 
    --template-body file://cloudformation/deployment.yaml 
    --parameters file://cloudformation/deployment-params.json 
    --capabilities CAPABILITY_IAM

Attribution



上一篇:gpt-2_bot

下一篇: gpt2-french

用户评价
全部评价

热门资源

  • seetafaceJNI

    项目介绍 基于中科院seetaface2进行封装的JAVA...

  • spark-corenlp

    This package wraps Stanford CoreNLP annotators ...

  • Keras-ResNeXt

    Keras ResNeXt Implementation of ResNeXt models...

  • capsnet-with-caps...

    CapsNet with capsule-wise convolution Project ...

  • inferno-boilerplate

    This is a very basic boilerplate example for pe...