Skip to main content

Describe your data collection choices

Every data collection campaign, even the most comprehensive CASTEMO annotation, necessarily makes choices, and is selective.

In collaboration between several users, and also as time goes by, it becomes increasingly tricky to remember what data collection guidelines you used, what you included, what you skipped... while this is, obviously crucial for the interpretation of results based on the data collected. Therefore:

You should always describe your data collection choices.

This can be done in a freely structured way in a document, in a 

It is generally better to keep the description of data collection choices very close to the data themselves.

In CASTEMO knowledge graphs, we generally recommend to append the description of the collected data as part of the metadata of the Territory which holds this CASTEMO data. This can be done through a series of Metaproperties of the Territory. We recommend using the following properties:

  • Start date.
  • End date.

By contrast, the following data should be understood more as