Data Repository

Explore datasets produced and aggregated by TWIN4DEM. Our online platform will serve as a FAIR data point, with data archived in repositories like Open Science Framework, the European Open Science Cloud (EOSC), and GitHub.

Each dataset will include:

  • Full metadata (variables, methodology, data source) using the OSF metadata standard.
  • Open formats such as CSV, XML, JSON, and RDF.
  • Persistent identifiers (DOI) – linking to Open Science Framework and GitHub repositories.

Key Datasets under development

  • Interlinked Database on Executive Aggrandisement: A novel database linking legal acts to individual parliamentary votes, public statements by politicians, and reactions from civil society actors and international organisations.
  • Synthetic Datasets: GDPR-compliant synthetic datasets that combine textual data with non-textual sources (socio-economic, electoral, survey) to create copies of societies and political systems for analysis.
  • Multilingual Textual Corpora: Curated corpora of legal documents, parliamentary debates, etc.

The project datasets will be made discoverable through direct hyperlinks to resources hosted on GitHub and the Open Science Framework.