MetaBrainz Database Dumps

All data dumps are available under Creative Commons licenses.

In order to keep the MetaBrainz Foundation operating, and these datasets maintained and updated, we require financial support from our commercial supporters. Without this support maintaining these datasets would not be possible. As such, even when a dataset is available under the Creative Commons Zero (CC0) license (public domain), we need the support of commercial supporters.

MusicBrainz PostgreSQL Data Dumps

MusicBrainz data dumps include all public data from our MusicBrainz project. This includes artists, releases, labels and the relationships between them and much, much more. In addition, we provide a full history of changes that the MusicBrainz community has made to the data, and the average ratings and tag counts (including genre tags) added by the community.

These data dumps are intended to be imported into a PostgreSQL database; we recommend that you use the musicbrainz-docker project to import these dumps. We do not recommend working with them on their own, or attempting to import the data into a different database system than PostgreSQL, which represents a non-trivial amount of work.

MetaBrainz provides full database exports of the MusicBrainz database twice a week. More regular (and comfortable) updates are available via the Live Data Feed, which allows keeping a local mirror in sync with the main MusicBrainz database every hour.

Dataset summary
Documentation: MusicBrainz Database page
Commercial use: Allowed, but financial support strongly urged, even for CC0 data.
Update frequency: Twice a week, Wednesdays and Saturdays.
Licenses: Creative Commons Zero (CC0) for core data /
CC Attribution-NonCommercial-ShareAlike 3.0 for supplementary data.
Format: XZ compressed custom PostgreSQL table dumps

MusicBrainz JSON Data Dumps

MetaBrainz provides access to the music metadata in the MusicBrainz database in the easily consumable format of JSON documents. If you cannot work with PostgreSQL or you prefer to work with a document oriented data store, then this data dump is for you.

There are individual dump files for each of the following data entities in MusicBrainz: Area, Artist, Event, Instrument, Label, Place, Recording, Release Group, Release, Series and Work. Please note that the data is not normalized and will contain duplicate data, in order to make the data dumps easy to import.

Dataset summary
Documentation: JSON Data Dumps page
Commercial use: Allowed, but financial support strongly urged, even for CC0 data.
Update frequency: Twice a week, Saturdays and Wednesdays.
Licenses: Creative Commons Zero (CC0)
Format: XZ compressed JSONL (one JSON document per line)

ListenBrainz PostgreSQL Data Dumps

The ListenBrainz project serves as an archive where users can store their music listening history. ListenBrainz provides these users with insights into their listening behaviors by creating detailed statistics reports, as well as providing other music-focused social features.

This dataset can be used to create and study music consumption patterns, and to create new music datasets. ListenBrainz itself is using this data to power a music recommendation engine and to create other derived datasets.

The data dumps are large, containing hundreds of millions of listens. Due to its size, we cannot provide this data in other formats and we update the full dumps only twice a month. Incremental dumps are also available daily, which provide all the listen data that was added since the last full dump.

Dataset summary
Documentation: ListenBrainz data dumps documentation
Commercial use: Allowed, but financial support strongly urged, even for CC0 data.
Update frequency: Twice a month, on the 1st and 15th, with incremental dumps daily.
Licenses: Creative Commons Zero (CC0)
Format: XZ compressed PostgreSQL table dumps

CritiqueBrainz PostgreSQL Data Dumps

CritiqueBrainz is a repository for Creative Commons licensed music and book reviews. It connects factual metadata from the MusicBrainz and BookBrainz databases with opinions from critics, listeners, and readers by providing a platform for their reviews.

MetaBrainz provides full dumps of the CritiqueBrainz database as well as JSON dumps containing reviews in an easily consumable format. These datasets are updated every day.

Dataset summary
Update frequency: Daily
Commercial use: Allowed, but financial support strongly urged, even for CC0 data.
Licenses: CC Attribution-NonCommercial-ShareAlike 3.0 and CC Attribution-ShareAlike 3.0
Format: bzip2 compressed PostgreSQL table dumps and JSON dumps