Curtailment (SPEN_009) Data Quality Checks — SPENOpenDataPortal

This data table provides the detailed data quality assessment scores for the Curtailment dataset. The quality assessment was carried out on the 31st of March.

At SPEN, we are dedicated to sharing high-quality data with our stakeholders and being transparent about its' quality. This is why we openly share the results of our data quality assessments. We collaborate closely with Data Owners to address any identified issues and enhance our overall data quality. To demonstrate our progress we conduct, at a minimum, bi-annual assessments of our data quality - for datasets that are refreshed more frequently than this, please note that the quality assessment may be based on an earlier version of the dataset. To learn more about our approach to how we assess data quality, visit Data Quality - SP Energy Networks.

We welcome feedback and questions from our stakeholders regarding this process. Our Open Data Team is available to answer any enquiries or receive feedback on the assessments. You can contact them via our Open Data mailbox at opendata@spenergynetworks.co.uk.

The first phase of our comprehensive data quality assessment measures the quality of our datasets across three dimensions. Please refer to the data table schema for the definitions of these dimensions. We are now in the process of expanding our quality assessments to include additional dimensions to provide a more comprehensive evaluation and will update the data tables with the results when available.

Attachments

Click to expand Click to collapse

Dataset schema

Click to expand Click to collapse

Name

Title of the data table.

No description available for this field.

Name (identifier)	name
Type	text
Sample

Field

Name of column that data check has been applied to.

No description available for this field.

Name (identifier)	field
Type	text
Sample

Description

Details of data check applied.

No description available for this field.

Name (identifier)	description
Type	text
Sample

Score

Percentage of rows that adhere to data quality check.

No description available for this field.

Name (identifier)	score
Type	decimal
Sample

Failed Rows

Number of rows that did not adhere to the data quality check.

No description available for this field.

Name (identifier)	failed_rows
Type	decimal
Sample

Dimension

VALIDITY measures whether the values in a dataset are within the correct range or format. This dimension ensures that the data adheres to predefined criteria set by the data owner, such as acceptable value ranges, formats, and types.

COMPLETENESS checks whether the cells in a dataset are filled or empty. The score is based on a simple 'Yes/No' - if the cell is filled, it counts as complete. This check does not consider if the value in the cell is correct/valid.

UNIQUENESS measures how many values in a dataset are unique. Any duplicate values will lower this score. This measure is important for data that must be unique to be correct, such as Customer ID or Project Reference ID.

No description available for this field.

Name (identifier)	dimension
Type	text
Sample

JSON Schema

The following JSON object is a standardized description of your dataset's schema. More about JSON schema.

{

"title":"spen_data_quality_curtailment",
"type":"object",
"oneOf":
[
- {
  - "$ref":"#/definitions/spen_data_quality_curtailment"
  }
]
,
"definitions":
{
- "spen_data_quality_curtailment":
  {
  - "properties":
    {
    "records":
    {
    "type":"array",
    "items":
    {
    "$ref":"#/definitions/spen_data_quality_curtailment_records"
    }
    }
    }
  }
  ,
- "spen_data_quality_curtailment_records":
  {
  - "properties":
    {
    "fields":
    {
    "type":"object",
    "properties":
    {
    "name":
    {
    "type":"string",
    "title":"Name",
    "description":"Title of the data table."
    }
    ,
    "field":
    {
    "type":"string",
    "title":"Field",
    "description":"Name of column that data check has been applied to."
    }
    ,
    "description":
    {
    "type":"string",
    "title":"Description",
    "description":"Details of data check applied."
    }
    ,
    "score":
    {
    "type":"number",
    "title":"Score",
    "description":"Percentage of rows that adhere to data quality check."
    }
    ,
    "failed_rows":
    {
    "type":"number",
    "title":"Failed Rows",
    "description":"Number of rows that did not adhere to the data quality check."
    }
    ,
    "dimension":
    {
    "type":"string",
    "title":"Dimension",
    "description":"VALIDITY measures whether the values in a dataset are within the correct range or format. This dimension ensures that the data adheres to predefined criteria set by the data owner, such as acceptable value ranges, formats, and types. COMPLETENESS checks whether the cells in a dataset are filled or empty. The score is based on a simple 'Yes/No' - if the cell is filled, it counts as complete. This check does not consider if the value in the cell is correct/valid. UNIQUENESS measures how many values in a dataset are unique. Any duplicate values will lower this score. This measure is important for data that must be unique to be correct, such as Customer ID or Project Reference ID."
    }
    }
    }
    }
  }
}

}

Similar datasets