sgcn_original

SGCN Processing

This repository contains code for processing the data that USGS integrates and synthesizes for the State Species of Greatest Conservation Need. The scripts here start from the repository for original source data collected from the states and stored in ScienceBase. Data are processed into a still experimental online data system we are experimenting with via an API.

Much of the processing in these notebooks is now dependent on modules from the bis and bis2 packages. The bis package can be found under the same usgs-bcb org on GitHub. The bis2 package contains sensitive information and is available elsewhere.

Current working stuff

The following code is mostly working here. Everything else is still under development and may or may not work for you.

Process SGCN repository source files (notebook and separate script) - This code works through the ScienceBase collection for SWAP source files, checks to see if the data are already in the system, reprocesses as configured, and syncs the data in our online GC2 instance with the data from the source files.

Register SGCN Unique Species Names in TIR (notebook and separate script) - This code grabs up all unique species names from the SGCN data that have not been previously registered in the TIR, packages up registration informaiton, and inserts the data into the TIR for further processing.

Transition between GC2 environments

We're currently in a transition on the SGCN data from one instance of the GeoCloud2 (GC2) environment to another. We started experimenting with the mapcentia.com instance because they were good enough to offer it to us for free and not limit what we could work against. We've now set up a new instance as part of our DataDistillery framework on the ESIP Testbed and are starting to transition code to hit that platform instead. For a while, some of the codes in this repo will continue working against the Mapcentia instance, but we'll get them all cut over once we have all of the data spun up on the DataDistillery instance.

As we are working the code toward that new instance, we are making some adjustments based on what capability the new platform affords. That includes changing some table names and property names that might play a slight bit of havoc on anyone who coupled things too tightly to the old structures. For the most part, we're trying to use PostgreSQL views and associated ElasticSearch indexes as a layer of abstraction between the somewhat fluid underlying data management/production environment and the distribution data for web apps and other uses.

Provisional Software Disclaimer

Under USGS Software Release Policy, the software codes here are considered preliminary, not released officially, and posted to this repo for informal sharing among colleagues.

This software is preliminary or provisional and is subject to revision. It is being provided to meet the need for timely best science. The software has not received final approval by the U.S. Geological Survey (USGS). No warranty, expressed or implied, is made by the USGS or the U.S. Government as to the functionality of the software and related material nor shall the fact of release constitute any such warranty. The software is provided on the condition that neither the USGS nor the U.S. Government shall be held liable for any damages resulting from the authorized or unauthorized use of the software.

智能在线

400-630-6780
聆听.建议反馈

E-mail: support@tusaishared.com