Datasette Data Exploration and Publishing Tool
Datasette is an open-source Python tool for exploring and publishing data. It turns any SQLite database into an interactive web interface with a JSON API, enabling data journalists, researchers, and developers to share datasets without writing application code.
Datasette is an open-source Python tool for exploring and publishing data. It turns any SQLite database into an interactive web interface with a JSON API, enabling data journalists, researchers, and developers to share datasets without writing application code.
npx skills add agentskillexchange/skills --skill datasette-data-exploration-publishing-tool
Datasette is an open-source multi-tool created by Simon Willison for exploring and publishing data. It takes one or more SQLite database files and instantly serves them as an interactive website with a full JSON API. The tool is aimed at data journalists, researchers, archivists, and anyone who needs to make data accessible and queryable without building custom applications.
Running datasette serve database.db starts a local web server that provides a browsable interface for every table, view, and query in the database. Users can filter rows, sort columns, apply facets, and run arbitrary SQL queries through the web interface. Every page also serves as a JSON API endpoint, making it easy to build downstream tools that consume the published data programmatically.
Datasette supports one-command deployment to cloud platforms. datasette publish heroku database.db or datasette publish cloudrun database.db packages the database and application into a Docker container and deploys it, giving you a public URL in minutes. This publish workflow has made Datasette popular in newsrooms and government agencies for releasing open data.
The plugin ecosystem extends Datasette with capabilities like full-text search, map visualizations, chart rendering, authentication, and write APIs. The companion tool sqlite-utils provides a CLI and Python library for creating SQLite databases from CSV, JSON, and other formats, making Datasette part of a broader data pipeline. Datasette also integrates with GitHub Codespaces for cloud-based data exploration.
With over 10,000 GitHub stars, an Apache 2.0 license, and active development since 2017, Datasette has become a foundational tool in the data publishing ecosystem. It is available via pip, Homebrew, and Docker, and requires Python 3.8 or higher.