1. Home
  2. Provider Support Manual
  3. Collecting and Submitting Data
Provider support manual:

Collecting and Submitting Data

Data Management Principles

Data management principles for ILR completion

You must adhere to the following five Data Management Principles when completing the ILR.

Principle 1: The ILR must accurately describe the provision delivered to each learner

The data you record on the ILR must accurately reflect the journey for the learner and what has happened. Inaccurate information must never be entered even where it is perceived that this would result in a more equitable claim for funding or accurate record of performance.

If no learning is delivered for a learner, then no learning should be recorded on the ILR.
For example, if a learner withdraws without attending the first class, then this learner is not included on the ILR.

You must not record this on the ILR with a Completion status of ‘withdrawn’.

Principle 2: The ILR must accurately and comprehensively reflect what is recorded in the learner file or learning agreement

The learner file (or learning agreement) records the goals that the learner and provider have agreed. It is against these goals that provider performance, in terms of achievement rate, is measured.

It is recognised that the learning aim may be agreed during the initial period of learning for a long qualification. However, a learning aim must not be changed once set. It is reasonable to expect that the goal should be agreed as soon as possible.

Consequently, providers must agree and record the learning aim within the funding qualifying period as defined in the relevant ESFA funding documentation.

Where a provider and learner agree to a change of aim after delivery of the aim has commenced and the funding qualifying period has passed, then this change must be recorded as a transfer in the ILR.

Principle 3: For any particular return, a provider must meet the timeliness specification

Where there is a collection reference date on the ILR data collection timetable (Appendix A), you must accurately describe in the ILR all provision delivered up to and including the collection reference date. The provider may include data for provision delivered after the collection reference date.

For ILR returns that do not have a reference date, you must return new starters, leavers and changes in a timely manner.

Where data describes provision to be delivered in the future, that is beyond the reference date or beyond that required to meet the timeliness standard, no one should assume this data is complete or accurate.

Principle 4: Basic pieces of information about a learner and their learning must remain constant once entered in the ILR except where the information has been entered in error.

The following fields in particular should not be changed without keeping a record of the reason for the change:

  • Planned learning hours

  • Planned employability, enrichment and pastoral hours

  • Postcode prior to enrolment

  • Learning aim reference

  • Funding model

  • Learning start date

  • Planned hours

The Learning planned end date field must not be changed once set and this is stated specifically in the ILR specification.

Where inaccurate data is sent, Principle 1 takes precedence. It is more important to correct inaccurate data than to not change fields.

Principle 5

Providers should aim to implement data management best practice when processing learner data within their systems to deliver timely and accurate data in their ILR.

How should I collect ILR data?

How should I collect ILR data?

You can collect the data required to make an ILR return in whatever way you wish to and in the best way that supports your natural business processes. For example, information about a learner may be gathered on a paper enrolment form or through an online enrolment process. Much of the information about the learning aims and programmes being undertaken may be held within a Management Information System (MIS) and can be exported directly from this.

You must ensure that you collect data in line with the ILR data management principles, particularly Principle 2: the ILR must accurately and comprehensively reflect the evidence recorded in the learner file or learning agreement.

Data protection requirements

You should make sure that all learners have seen the Privacy Notice, which informs them about how their data will be used. The Privacy Notice can be found here: https://guidance.submit-learner-data.service.gov.uk/ilrprivacynotice

You should ask learners if they agree to be contacted for marketing, survey or research purposes and record this information in the Learner contact preference fields in the ILR.

You are required to ensure that the requirements of the General Data Protection Regulation (GDPR) are maintained at all times.

Learner Files

This was previously called ‘Learning Agreements’. All learners must have a learner file.

The ESFA funding rules and Advanced learner loans funding rules provide detailed information about the requirements for the learner file for ESFA and Advanced Learner Loans funded further education.

The ESFA 16 to 19 education funding regulations document provides funding compliance guidance in relation to learner recruitment, existence, retention and achievement.

Paper forms

All ILR data is required to be reported electronically to the ESFA, however you may use a paper form at the point of contact with the learner to capture data about the learner and this may form part of the learner file or learning agreement.

You are encouraged to use your own processes to gather ILR data which best meet the needs of your organisation. This may not involve the use of paper forms at all.

How do I send data to the ESFA?

How do I send data?

ILR data must be sent to the ESFA by uploading a file in XML (extensible mark-up language) format through the ‘Submit Learner Data’ service.

You file names must be unique (not used in the service before) and follow the naming convention.

It is best practice to send data to the ESFA regularly and not wait until a return date to upload a file. You should upload an ILR file to Submit Learner Data early in a return period to allow enough time to resolve any validation errors and warnings.

You should produce this XML file from your MI system and upload it to the ESFA via Submit Learner Data.

If you do not have an MI system capable of generating an XML file, the ESFA provide an ILR Learner Entry Tool which can be used to create an ILR file for upload to Submit learner Data.

If you experience problems uploading an ILR file, you should contact the ESFA service desk as soon as possible.

File compression

We recommend that ILR files are uploaded as compressed files; compressed files, sometimes referred to as archives or .zip files, contain a version of the ILR data that is significantly smaller than the uncompressed XML file. Upload times for compressed files are shorter, which benefits all providers and contributes to maximum reliability of the service.

Each compressed file should contain one ILR XML file only. The file must not be encrypted, or password protected. The name of the .zip file should be the same as the name of the XML file but with the extension .zip instead of .xml.

ILR file transmission

Your file transmissions must contain all Learner records and Learning Delivery records for all learners across all funding models for the year to date, for that ILR return.

Each file you submit overwrites all previously submitted records from your organisation. This means that you cannot split your data into separate files and transmit each file separately. You cannot send records for learners funded from different funding models in separate files as these will overwrite one another.

You must submit a single file containing all of your learner and learning records for all funding models, for the year to date.

Take care to ensure that data is not overwritten in error: reports such as the ‘Rule Violation Report’ are useful for checking that the expected number of learners and learning delivery records are in your file.

Once you have submitted a file it cannot be deleted, if the file contains incorrect data this can only be corrected by submitting another ILR file to overwrite the incorrect one.

The last file you submit prior to the close of an ILR return will be the one loaded into the national database for that particular return.

You may need to use the amalgamation tool in order to create a single ILR file containing all learner records for all funding models.

The impact of incomplete information

If you sent ILR data that contains incomplete, incorrect or omitted data, this may result in the learner record not being accepted onto the national ILR database. It is essential that ILR data is completed and returned promptly, fully and accurately.

Any data for individual ESFA funded learners that is not accepted by the national ILR database will mean that the Provider Funding Report (PFR) will not show the learner’s details and may result in non-payment or clawback of funding in that period for providers paid on actuals.

Funding calculations and provider payments for all ESFA funded learning aims are based directly on the data provided in the ILR. Inaccurate or late information may result in payments not being made.

You must ensure that all documentation relating to the enrolment of the learner and the record of learning activity is completed accurately and conforms to the eligibility rules of the appropriate type of learning. Funding compliance actions may be taken by the ESFA if your data and documentation do not support the eligibility of a funding claim.

The Funding Information System (FIS)

The FIS is a standalone piece of software produced by the ESFA.

The software contains the ILR Validation rules and allows you to check your ILR data before submitting it through Submit Learner Data. The FIS also contains most of the funding calculations for the ESFA and a number of funding reports which may be run and exported based on your data.

The funding reports generated by the FIS are indicative reports. Please note that, in some cases, the results of the funding calculation for Funding model 36 may be incorrect because not all data required to accurately calculate funding can be made available in the FIS. You should refer to the online service for your actual earnings and payments reports.

Some validation rule checks are not included in the FIS and only take place when data is submitted to us. These are rules that check external tables, such as the Unique Learner Number (ULN), Employer identifier and postcode validation.

Documentation about the FIS including installation/uninstallation instructions and user guidance is available through the Submit learner data service on GOV.UK.

Combining ILR files: amalgamation

If you have multiple MI systems across your organisation you will need to combine the ILR data from these separate systems into a single ILR file before you submit it to the online service. For example, you may hold apprenticeship learners and learning aims in a separate system to Adult Skills Fund funded learners and learning aims or have multiple internal departments that hold data separately. If this is the case, you will need to amalgamate these separate files into a single ILR file for submission.

The ILR File merge tool (available from Submit learner data ) can combine multiple files to create a single XML file for all learners. You can also use your own software to create a single file.

The ILR File merge tool joins together records from multiple ILR files for learner records that have the same Learner reference number. For the records to be successfully combined into a single learner record, all the data in the Learner fields on all of the records must be the same. Any conflicting learner data will result in an amalgamation error which will need to be corrected. For example, if the Date of birth has been incorrectly recorded on one record and does not match the Date of birth on the other record then an amalgamation error will be generated, and the records will not be combined.

Where data for the same learner is held in separate systems but with different Learner reference numbers, you should combine these into a single learner record. If you merge two or more records for the same learner, you may use any one of the learner’s previous Learner reference numbers.

Any changes to a learner’s Learner reference number from one ILR year to the next must be recorded using the Learner reference number in previous year field. This will enable data matching over multiple ILR years for a learner to be carried out for purposes such as achievement rate calculations. See the ILR Specification for further details and collection requirements for this field.

Data validation

ILR data is subject to validation checks to help ensure that the data received by the ESFA is accurate and complete. ILR files are validated at the point of transmission to the online service or through the FIS against both the XML schema definitions and validation rules.

If any data fails the validation checks, then the learner record and all associated records for that learner are rejected. Rejected records are not loaded into the national ILR database and do not generate funding; these records are reported on the rule violation report.

The FIS does not run all validation checks: rules that rely on external tables such as postcode or Unique Learner Number tables are not included in the FIS. These checks only take place when you submit your data.

The ILR validation rules document includes details about the error condition for each rule and the ILR data it checks. The validation rule document includes a column named “Online Only”, which identifies rules that are not included in FIS and are only run when data is submitted to the online service.

XML schema validation

XML schema validation ensures that your file is in the correct format and that the data complies with the definitions in the ILR specification. There are two stages to the XML schema validation

Stage 1: The ILR file is checked for the following:

  • That the XML is well-formed. Well-formed means that the file adheres to XML’s strict syntactical rules for open and close tags and the nesting of data elements

  • Elements are presented in the expected sequence, as defined in the schema

  • An element conforms to its data type. Examples of this would include where a numeric item contains alpha characters, or where a date contains invalid values

If any part of the file fails any of these checks, then the whole file will be rejected and not processed further, the errors will be reported on the rule violation report.

Stage 2: The individual elements (fields) within the file are checked for the following:

  • All mandatory fields have been returned

  • Field lengths are adhered to

  • Data patterns are adhered to

If a field fails any of these checks, then the learner record, and associated records for that learner only, will be rejected, reported on the rule violation report and not processed further.

The published XML schema contains all the definitions and requirements for both stages of schema validation.

Schema errors are reported on the rule violation report, however, the error messages generated are generally briefer than those for other validation errors.

Only learner records, and associated records for that learner, that have passed schema validation will be passed through to the validation rules.

Validation rules

The final data checks are made by applying the validation rules, which include business validation and funding eligibility checks.

Validation errors  are produced where data does not make sense or cannot be correct. You will need to correct any records that produce validation errors.

If any part of a learner or associated learning delivery record triggers a validation error, then the learner and all of their learning delivery records will be rejected and not processed further. Only valid learner records are loaded into the national ILR database.

Validation errors are reported on the validation error reports which are produced on the online service and in the FIS. You should check these reports after uploading an ILR file. Validation errors that start “FD_” indicate data that is breaking schema restrictions. You should check this data against the definition of the field in the ILR specification.

All learning aims returned for a learner must be valid, regardless of the type of funding. For example, if funded learning aims are included alongside non-funded aims in a return at the start of the year, then all of these aims must be valid in order for the learner records to be accepted.

Validation warnings  are produced where the data is unusual or where the data is allowed under certain circumstances. Records that produce warnings are loaded into the national database; however you should check these records to ensure the data is correct.

Postcode validation

It is important that postcodes are correctly formatted, this includes the space between the two parts of the postcode. Postcodes with two spaces will fail validation when the data is uploaded to the online service.

Most postcode fields in the ILR are mandatory fields and cannot be left blank. For mandatory postcode fields, if the postcode is unknown or does not apply then a value of ZZ99 9ZZ should be recorded.

For the Learning start date postcode field and if the learner's postcode is unknown or not yet updated on the ONS postcode table (i.e. a new build), providers must return a postcode of ZZ99 9ZZ.

Monitoring the funding rules

As part of financial assurance processes, we review how the funding system and funding rules are operating to identify possible errors in funding claimed and areas requiring further investigation. The outcomes of this also allow us to confirm that policy specifications are working and achieving the desired outcome(s). As part of this funding monitoring process, we also aim to improve the overall quality of the data being reported to the ESFA.

The funding monitoring plan lists the current areas that are being monitored during the course of the funding year. You can access regular monitoring reports through the online service which will highlight any potential issues to you.

If we identify records in a monitoring area, you must review this data to investigate and determine whether you need to make any data corrections in your next ILR submission. Where there is an explicit statement of the action you must take, then you must comply with this guidance: actions to take can be found within the funding rules, ILR guidance or the monitoring reports user guide.

If you have data errors at the end of the funding year that have not been corrected by the R14 ILR hard close, the following actions may be taken:

  • Funding recovered where funding has been claimed in error,

  • Assurance visits conducted for specific issues before the end-of-year reconciliation statements are issued, and/or

  • Issues used to inform the monitoring process for selecting providers for audit in future funding years.

Further details and guidance documents about funding monitoring can be found on the ESFA financial assurance webpage.

Provider data self-assessment toolkit

The provider data self-assessment toolkit (PDSAT) produces a suite of reports to test the integrity of ILR data in order to assist you with identifying potential anomalies or errors in your data.

Within the suite, PDSAT includes reports that enable providers to identify errors relating to monitoring of the funding rules. The ESFA uses a similar set of reports to routinely monitor the ILR data, to identify potential errors, and for assurance that providers are complying with the funding rules.

We encourage you to use the PDSAT frequently throughout the year as part of your routine data cleansing, for identifying errors and preparing for financial assurance visits. PDSAT is designed to give you a toolkit to review your ILR data and to assist auditors in the audit of providers’ data.

The current PDSAT download and guidance can be found in the 'Help and support' section of the ILR home page.

Earnings adjustment statement (EAS)

The Earnings Adjustment Statement (EAS) is a CSV file, submitted online, that providers can use to claim ESFA funding that is not reported on the ILR.

The ESFA funding rules give further information on categories within the EAS and their evidence requirements.

The EAS is available for collection periods corresponding to the 12 months of the funding year. Funding for period 1 is added to funding from the ILR from August on the Funding Summary Report, period 2 funding is added to funding from the ILR for September, and so on. The total amount of funding claimed in each of the collection periods’ EAS will be summarised to calculate the total for the funding year.

There are different expectations on how regularly EAS returns are submitted depending on whether the return is for a contract-funded provider or a grant-funded provider. See the 'Help and support' section of the ILR home page for EAS documentation.

When do I send data to the ESFA?

ILR data must be returned according to the data collection timetable (Appendix A) of the ILR Specification.

The ILR timetable sets out:

  • The return date by which you must send complete data for the purposes described in the timetable

  • The data uses and purpose of the return

  • Which types of provider need to send data for each particular return

The return date in the timetable is the hard close date for including data in the national database for that return and represents the last opportunity to send data for each return.

You must ensure that the data held on the national database is complete and fit for purpose by these dates.

You must ensure that the data held by the ESFA is up to date for the purposes described in the timetable by 6pm on the fourth working day of the month. Any data sent after 6pm on the return date will not be processed for the purposes of that particular return.

Where the timetable includes a ‘reference date’ for any particular return, your ILR data must accurately describe all provision delivered up to and including this date. You may include data for provision delivered after the reference date. For returns that do not have a reference date, you must return new starters, leavers and changes in a timely way.

ILR timeliness requirements:

  • New starts must be reported within 2 reporting months of their start date

  • Achievements must be reported within 3 reporting months of the point of achievement

  • Apprenticeship withdrawals must be reported within 3 reporting months of their leave date.

The type of provider you are determines what data returns are required. In broad terms, FE colleges are required to send data on a quarterly basis and training organisations on a monthly basis, however there are exceptions to this.

Data is required monthly from all providers for the following. You must ensure that the data is complete and error-free for:

  • 16-18 apprenticeships (frameworks and standards)

  • 19+ apprenticeship standards

  • Apprenticeships started on or after 1 May 2017 (Funding model 36)

  • Skills Bootcamps (Funding model 37)

FE colleges and other grant funded providers must decide for themselves what data is sent in addition to the data essential to meet that required for any particular return date. The data required monthly (detailed above) must be complete and error free while data for other provision that is sent in the same file can be incomplete and contain errors.