Skip to content
This repository has been archived by the owner on Mar 12, 2020. It is now read-only.

Xennis/dataflow-demo

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

dataflow-demo

Build Status

Setup

Local setup

Requirements

  • Python 2.7 is installed
  • Google Cloud SDK is installed

Create a virtual environment and install the dependencies

virtualenv --python python2.7 .venv
. .venv/bin/activate
pip install --requirement requirements.txt

Login with the Google Cloud SDK

gcloud auth login
gcloud auth application-default login

GCP setup

  • Create a bucket
  • Enable the dataflow.googleapis.com API

Run

cd customer
python customer.py \
    --setup_file ./setup.py \
    --runner DataflowRunner \
    --project ${GCP_PROJECT} \
    --region ${GCP_REGION} \
    --temp_location gs://${GCP_BUCKET}/.tmp/ \
    --staging_location gs://${GCP_BUCKET}/.stating \
    --input gs://${GCP_BUCKET}/customer/customer-001.json

About

Apache Beam (Google Dataflow) demo

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages