Skip to content

kbrock91/dbt-formula-1-demo

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

35 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Title: Python Snowpark Formula1

Work in Progress (WIP) Notice

We will be iteratively updating this project for code cleanup, automation, and developing best practices. So far the list of future improvements is as follows:

  • setup folder for connecting to public s3 bucket
  • google drive link to our guide (will be subsequently replaced by dbt ecosystems page)
  • yaml selectors for training and prediction; prediction only
  • codegen for the 8 staging files
  • label encorder clean up for numeric variables
  • ohe for the categorical variables
  • multi-class accuracy
  • trying out https://github.com/omnata-labs/dbt-ml-preprocessing for some of preprocessing?

Project Description

A repo using open source Formula1 to show how dbt cloud combines 1) SQL and python 2) analytics and machine learning (ml). We are able to blend these together seamlessly using Snowpark for python on Snowflake.

How to Run the Project

Placeholder for the guide link. The script to connect to the data is placed in the setup folder.

Credits Placeholder

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%