sqlflow_public
SQLFlow is a tool that automates data lineage discovery by analyzing the SQL script. It generates a nice clean diagram to show the dataflow among the table/view and columns in the data warehouse.
Support more than 20 major databases including bigquery, couchbase, dax, db2, greenplum, hana, hive, impala, informix, mdx, mysql, netezza, openedge, oracle, postgresql, redshift, snowflake, sqlserver, sybase, teradata, vertica,
Just paste the SQL script and click a button, you will get the data lineage diagram instantly, highlight the dataflow in the diagram with a simple mouse click.
You can also call the RESTful API provided by this tool in your own program and get the data lineage and diagram model information in a JSON snippet to make further usage.
Send the SQL script received from the browser in JSON to the backend.
After receiving the result which includes the data lineage and diagram model generated by the backend, visualize the diagram model in the browser.
Highlight the dataflow in the diagram when the user clicks on a specific column.
SQLFlow-Analyzer
: receiving the SQL script from the frontend and parse the SQL script into parse tree nodes
by utilizing the GSP library, then calculate the data lineage by analyzing AST.
The SQLFlow-Analyzer
component is open source and can be executed as a standalone tool. Take SQL text
as input and generate data lineage in JSON format. Check SQLFlow.java
for more.
FlowLayout
: Calculating the layout of database objects(table/column) in the dlineage and
generate the diagram model with all necessary position data, including nodes and edges.FlowLayout
depends on doLayout library
to layout the database objects.
Return a JSON snippet including the data lineage and diagram model to the frontend.
Visit gudusoft.com website using browser
You may paste your SQL script into the SQLFlow web page, or upload the SQL file to the site. select the correct database and then click the visualize button.
Use RESTFul APIs
SQLFlow provides RESTful API, so your program can communicate with the SQLFlow backend directly. Sending the SQL to SQLFlow backend and receive a JSON snippet including the data lineage and diagram model for further processing in your own program.
Please put the following authorization code in your request header when calling the RESTful API.
Authorization: token eyJ0eXAiOiJKV1QiLCJhbGciOiJIUzI1NiJ9.eyJhdWQiOiJndWVzdFVzZXIiLCJleHAiOjE1ODEyMDY0MDAsImlhdCI6MTU3MzQzMDQwMH0.-lvxaPlXmHbtgSFgW7ycu8KUczRiFZy5A1aNRGY-tKM
Add visualize feature in your own application/website
The SQLFlow consists of the frontend and backend. You may setup the frontend on you own web server, or include the frontend in your application to add visualize feature.
Your application still need connect to the SQLFlow backend in order to process the SQL code to get data lineage.
Install both frontend and backend on your own application/server
Setup both the frontend and backend of SQLFlow on our server. Please check the setup manual.
In order to setup SQLFlow on your own server, Please contact us (support@gudusoft.com) to obtain a commerical license to get all those distribution files.
Please note that the distribution files doesn’t include the source code of the SQLFlow except the FlowAnalyzer
which is open source here.
This article describes the relationship generated by the SQLFlow between column and column, column and table/view. One relation includes one target column and a relationship type and one or more source columns.
上一篇:fklearn-samples
下一篇:sqlflowserver
还没有评论,说两句吧!
热门资源
Keras-ResNeXt
Keras ResNeXt Implementation of ResNeXt models...
seetafaceJNI
项目介绍 基于中科院seetaface2进行封装的JAVA...
spark-corenlp
This package wraps Stanford CoreNLP annotators ...
capsnet-with-caps...
CapsNet with capsule-wise convolution Project ...
inferno-boilerplate
This is a very basic boilerplate example for pe...
智能在线
400-630-6780
聆听.建议反馈
E-mail: support@tusaishared.com