Thursday, October 7, 2021

Pleiades Data for Download

Creators: Sean Gillies
Contributors: Brian Turner, Tom Elliott
Copyright ©; The Contributors. Sharing and remixing permitted under terms of the Creative Commons Attribution 3.0 License (cc-by).
Last modified Oct 07, 2021 08:15 AM
Get complete and regular shapshots of all Pleiades resources, available in multiple formats including JSON, CSV, KML, and RDF.

Regularly updated exports ("dumps") of the published items in the Pleiades dataset may be downloaded in four different formats as described below.

The on-line resources under http://pleiades.stoa.org/places/ remain the canonical versions; the contents of the various export formats varies (see further below). The recently modified page or its corresponding RSS feed are the best ways to track what is changing.

Frequently, we synchronize copies of the JSON, CSV, and RDF/TTL content to a public repository on GitHub, the online software development version control company: https://github.com/isawnyu/pleiades.datasets.

Periodically, we deposit archival copies of the JSON, CSV, and RDF/TTL content in third-party preservation repositories. You can pick up a copy from the following places:

Document Actions

JavaScript Object Notation (JSON)

This is the only comprehensive dump. It contains all attributes of all place, name, location, and connection objects in the database that have been published.

Each morning, a single JSON file for all published places is written to a JSON file at http://atlantides.org/downloads/pleiades/json/. JSON is a widely used, well-known format that is popular for use in web applications and other programming tasks. We keep a week's worth of files, deleting older ones. The file named pleiades-places-latest.json.gz will always get you the most recent version. Note also that those previously published place resources that have been withdrawn and moved to the "errata" section of the site are dumped to a separate json file.

NB: JSON serializations of data for individual places can be had from links on the place pages, such as http://pleiades.stoa.org/places/579885/json for Athens, or by a content-negotiated request for the resource http://pleiades.stoa.org/places/579885#this.

Comma-Separated Values (CSV)

Each morning, tables summarizing the published locations, names, and places are written to compressed CSV files at http://atlantides.org/downloads/pleiades/dumps/. CSV is a widely used, well-known format that can be used by many computer programs. We keep a week's worth of files, deleting older ones. The files named pleiades-*-latest.csv.gz will always get you the most recent versions.

Users of these files will want to be aware of the following documentation:

Keyhole Markup Language (KML)

Suitable for use in Google Earth, QGIS, and other KML-capable software.

CAVEAT:  at present, the KML export uses for its coordinates the "Representative Point" from any place that has published locations. It does not include a representative point for places that have no published locations, but for which a represenative point is calculated by Pleiades on the basis of connected places. If you want comprehensive data, please use the Pleiades JSON export. We hope to address this deficiency in a future release.

Information about all mappable places are read from our database and written to a zipped KML (KMZ) file each morning. We keep a week's worth of files at http://atlantides.org/downloads/pleiades/kml/ and delete older ones. The KML format is well-documented on the Library of Congress Digital Formats website.

NB: KML serializations of data for individual places can be had from links on the place pages, such as http://pleiades.stoa.org/places/579885/kml for Athens, or by a negotiated request for the resource http://pleiades.stoa.org/places/579885#this.

Resource Description Framework (RDF)

The latest data for all places, errata, authors, place types, and time periods is available for download in Turtle (Terse RDF Triple Language) via http://atlantides.org/downloads/pleiades/rdf/pleiades-latest.tar.gz. This is a gzip-compressed, TAR archive. Previous RDF dumps are also available at http://atlantides.org/downloads/pleiades/rdf/. RDF dumps are updated weekly on Sundays.

NB: RDF serializations of data for individual places — in both Turtle and RDF/XML syntax — can be had from links on the place pages, such as http://pleiades.stoa.org/places/579885/turtle for Athens, or by a negotiated request for the resource http://pleiades.stoa.org/places/579885#this. Please see the README in https://github.com/isawnyu/pleiades-rdf for a description of the RDF and the vocabularies and ontologies used

Derived and Enhanced Versions

Pleiades GeoJSON

A GeoJSON version of published *Pleiades* places is generated regularly from the CSV export using code developed by Ryan Baumann (Duke University Libraries). Note that it is an abridged dataset: all published places are represented, but only a subset of the attribute data is included. For comprehensive data, please make use of the full *Pleiades* JSON dataset.

Pleiades Plus

Pleiades Plus (also known as Pleiades+) is an experimental machine alignment between Pleiades place resources and content in the GeoNames gazetteer. It pairs Pleiades Uniform Resource Identifiers (URIs) with GeoNames URIs when a given pair seems likely to identify the same place. This alignment was conceived and prototyped by Leif Isaksen (Pelagios Project) under the auspices of the Google Ancient Places project (you can read the original announcement from 2011 on the GAP Blog). The current version is produced periodically by Ryan Baumann (Duke University Libraries). Code and data are available from https://github.com/ryanfb/pleiades-plus. There is also an essential README file.

 

No comments:

Post a Comment