v1.1 Open corpus · CC0 1.0

Parliamentary text.
Open. Linked. Worldwide.

ParlText is an open corpus of parliamentary speeches and enacted laws, built on the ParlLawSpeech data standard. Version 1.1 covers four Central European legislatures — more are planned.

4
Legislatures
1990–2024
Coverage
Speeches
~4.1 GB CSV
Laws
~320 MB tab

ParlText 1.1: Four Central European legislatures.

Speeches and laws are linked by date, agenda, and speaker so a debate and the resulting law can be read together — country by country, or pooled across all four. More legislatures are planned for future releases.

The corpus

Speeches and laws — linked.

ParlText combines parliamentary floor speeches and enacted laws into a single linked corpus. Records carry dates, speakers, and agenda IDs that follow the ParlLawSpeech standard, so a debate and the resulting law can be read together — and statistics are comparable across jurisdictions.

The v1.1 release spans four Central European parliaments from 1990 onward; the framework is designed to onboard any legislature. Bill data is not yet part of the published Dataverse release.

Open access

CC0 1.0 on Harvard Dataverse.

All released versions live on the ParlText collection at Harvard Dataverse under the Creative Commons CC0 1.0 public-domain dedication. Speech files ship as .CSV and .RDS; law files ship as .tab and .RDS.

Browse the data →