Skip to content

Latest commit

 

History

History
12 lines (9 loc) · 5.12 KB

File metadata and controls

12 lines (9 loc) · 5.12 KB
metaLinks
alternates

Data Connectors

Each data connector consists of two Capsules, one which performs a query and outputs a data file (in parquet, text or other tabular format) and one which will create a Data Asset automatically from the results.

LogoTitleDescriptionInput Data
Databricks - Data ConnectorQuery a Databricks metastore to pull data using SQL.
  • None
Databricks - Data Asset GenerationCreate a Data Asset from a SQL query selecting from a Databricks metastore.
  • None
BigQuery - DataThis Capsule makes a connection to Google Cloud's BigQuery, runs a query and downloads the results as a file.
  • None
BigQuery - Data Asset GeneratorA Data Asset generator for Google BigQuery.
  • None
MySQL - Data ConnectorThis Capsule will query a MySQL database and output a file containing the requested data.
  • None
MySQL - Data Asset GenerationThis Capsule will run the MySQL - Data Connector Capsule to query a MySQL database and automatically create a Data Asset from the result.
  • None
Snowflake - Data ConnectorThis Capsule will query a Snowflake database and output a file containing the requested data.
  • None
Snowflake - Data Asset GenerationThis Capsule will run the Snowflake - Data Connector capsule to query a Snowflake database and automatically create a Data Asset from the result.
  • None

AWS Athena (Glue) Data ConnectorThis Capsule will submit a SQL query using Athena and will output a result file for use within Code Ocean.
  • None
AWS Athena (Glue) Data Asset GenerationThis Capsule will perform an AWS Athena query by calling an external Capsule (AWS Glue Athena connector) and generating an output Data Asset.
  • None
Redshift Data ConnectorThis Capsule will query a Redshift database and output a file containing the requested data.
  • None
Redshift Data Connector - Data Asset GenerationThis Capsule will run the Redshift Data Connector capsule to query a Redshift database and automatically create a Data Asset from the result.
  • None
Fetch data with ffqThis Capsule pulls data from GEO, SRA, EMBL-EBI, DDBJ or Biosample by accession number.
  • None
Download data from BaseSpaceThis Capsule downloads demultiplexed (fastq.gz) or raw (bcl) Illumina sequencing data through the Illumina BaseSpace CLI. This Capsule requires a BaseSpace account and NGS data owned or shared with the user.
  • None
Publishing data to TableauThe application is a Streamlit-based interface that simplifies interactions with Tableau. It assists in publishing new data sources to Tableau Cloud from local spreadsheet files.
  • .csv or .tsv files to upload to Tableau.