Cohort Data

The Kids’ Environment and Health Cohort aims to set up a national database containing de-identified data from schools, hospitals and community pharmacies, on health and education histories for all children born in England from 2006 onwards – around 11 million children. This data will be linked to information about their mothers’ health during pregnancy as well as data on local environments in and around children’s homes and schools.


What data will be used to develop the Kid’s Environmental and Health database? 

The project involves linking the following datasets:

  • ONS birth and death registration data
  • Census 2011 and 2021 data: children born within two years of each Census
  • Hospital Episode Statistics: contains data on hospital contacts
  • Maternity Services Data: holds data on maternal health during pregnancy
  • Mental Health Dataset: holds information on referrals to mental health services
  • Community Dispensing Data: information on dispensed medicines, including for asthma
  • National Pupil Database: which holds data on all children in state school, including special educational needs provision and exam results
  • Personal Demographic Service (NHS address records) and Getting Information About Schools Data (school addresses). These will be used by the ONS to link data on the local environment to children’s data.

A number of environmental datasets about small areas across England, on air pollution, energy efficiency of buildings, and proximity to major roads, will be linked to the de-identified health and education data.

The data will be linked subject to approvals from the Confidentiality Advisory Group, the ONS, NHS Digital and the Department for Education.


Potential of the newly linked data

The newly linked data resource will open opportunities for research that can inform government departments and local councils, as well as the public at large, about how changing local environments impact children’s health and education. It will also enable new insights into how well housing, environmental and planning policies are working to improve children’s lives.

In order to demonstrate how the Kids’ Environment and Health Cohort can be used, the team will carry out a research project to examine:

  1. links between local greenspace coverage and access and mental health in young people.
  2. links between the availability and quality of local childcare provision and primary education attainment.


Key questions these newly linked datasets can address include:

  1. Do children with better access to public parks or other greenspaces have better outcomes in school?
  2. Are children with asthma who grow up in highly insulated but less ventilated homes at risk of developing worse asthma symptoms?
  3. Is exposure to extreme heat or heatwaves during pregnancy linked to babies being born prematurely?
  4. Is going to school near gambling outlets linked to worse mental health in young people?
  5. For children with complex chronic conditions such as autism, epilepsy or cystic fibrosis, does living or going to school near traffic-heavy roads increase the risk of being admitted to hospital?


Data security and access

The Kids’ Environment and Health Cohort will be stored in the Office for National Statistics Secure Research Service (ONS SRS), a national Trusted Research Environment. 

The requirements that researchers need to meet before gaining access to the Kids’ Environment and Health Cohort data for their research is being discussed with data providers. All researchers accessing the Kids’ Environment and Health Cohort in the ONS SRS will need to be Accredited Researchers (see: and undertake further training depending on the consitituent datasets requested. Further, researchers need to obtain ethical approval and sign agreements with UCL before accessing the data. 

Data Security