Skip to content

Kim-Sha/text2sql-sodabot

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Fine-Tuning Language Models for Text2SQL Applications in Open Data (WIP)

See the full model card for sql-sodabot-v1.0 on HuggingFace.

This encoder-decoder model is a descendent of Salesforce/codet5-small, fine-tuned on a modified version of b-mc2/sql-create-context data. The original CodeT5 was published by Salesfoce Research as an "AI-powered coding assistant to boost the productivity of software developers". The goal of this project is to apply transfer learning in order to appropriate this model for text-to-SQL applications, specifically in the context of generating Socrata SQL (SoQL) queries that can be executed on the Socrata Open Data API (e.g., to analyze NYC Open Data).

About

Fine-tuning `Salesforce/codet5` for text-to-SQL applications on Socrata Open Data API

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors