The next generation of sports data

Based on IPTC’s SportsML and semantic web principles

IPTC Sport Schema is a high-level model that describes the core elements of competitive sport.

Get started now View the project on GitHub


Project goals and principles

Our goals were to ensure that the model and vocabulary is:

  • Comprehensive - it should be able to handle schedules, results and statistics for many types of sports, whether team, individual or head-to-head.
  • Easy to use - should be approachable by non-experts in Semantic Web technologies. For example, the JSON-LD versions should be simple enough that they can be parsed by any competent developer who is comfortable with handling JSON files.
  • Easy to query - for those who want to use the power of RDF features such as SPARQL, querying data should be as simple as possible.
  • Compatible with schema.org. We want the IPTC Sport Schema to be self-contained, but it should be possible to use it alongside schema.org in the future.

Getting started

Exploring the example data using SPARQL

We have created a set of test data based on many of the sample files from the SportsML specification. The SportsML data, converted to RDF triples via our SportsML-to-Sport Schema XSLT stylesheet, has been uploaded to a server running the Fuseki tool which can be queried using SPARQL, the query language for RDF.

Explore IPTC Sport Schema SPARQL Playground

If you haven’t seen SPARQL before, it’s quite simmilar to SQL. We have created some sample queries that can be used without any SPARQL knowledge.

You can also run the example queries from your local machine using the arq tool from the Apache Jena project (or any other SPARQL-compatible tool).

See the running example queries page for more detailed information.