69 fields · across 22 categories · 22 sourced from CreditSafe.
Classification
| Field name | Description | Data type | Source | Updated |
|---|---|---|---|---|
CICs | Custom Industrial Classifications (CICs) the company has been classified into. API includes the industry verticals within the same field. On the download, verticals are a separate column. These are custom RTICs. · Related: The Data City guide | array[companynumberrtic] | The Data City | Ad hoc |
ConsolidatedAccountsCompanyName | A list of company names within the same group structure that have consolidated accounts. This is a strong signal of the companies that are more likely to be the parent company. This is because it is common for the parent company to file consolidated accounts that include all the subsidiaries in the group structure. · CreditSafe-sourced | array[string] | The Data City | Monthly |
ConsolidatedAccountsCompanyNumber | A list of company numbers within the same group structure that have consolidated accounts. This is a strong signal of the companies that are more likely to be the parent company. This is because it is common for the parent company to file consolidated accounts that include all the subsidiaries in the group structure. · CreditSafe-sourced | array[string] | The Data City | Monthly |
DistinctBrandsInGroup | A list of distinct brands that can be found across the company’s group structure. | array[distinctbrandinfo] | The Data City | Monthly |
ESG_RTIC | A boolean value indicating whether the company is in an ESG RTIC. · Download name: ESGRTIC | boolean | The Data City | Monthly |
IS8Codes | Industrial Strategy Sector codes associated with the company, derived from RTICs and RSICs. · Nested object — see IndustrialStrategySector | array[industrialstrategysector] | The Data City | Ad hoc |
IsLikelyAgency | A boolean value indicating whether the company is likely to be an agency. This is based on assessment of the company’s financials. | boolean | The Data City | Monthly |
LinkedCompanyNumbers | A list of company numbers that are linked to the company, even if they are not connected through group structure. This is based on a combination of signals including shared officers, shared addresses, shared shareholders, and other signals. · CreditSafe-sourced | array[string] | The Data City | Monthly |
RSICs | Real-Time SIC (RSIC) codes with associated confidence score · Related: The Data City guide · Nested object — see RSIC | array[rsic] | The Data City | Monthly |
RTICs | RTICs the company has been classified into. API includes the industry verticals within the same field. On the download, verticals are a separate column. · Related: The Data City guide | array[companynumberrtic] | The Data City | Ad hoc |
SICHLUs | Companies House data. SIC Sections. Nature of business: Standard Industrial Classification (SIC) codes. · Related: Companies House guide | array[string] | Companies House | Monthly |
SICs | Companies House data. Nature of business: Standard Industrial Classification (SIC) codes. · Related: Companies House guide | array[string] | Companies House | Monthly |
UltimateLinkedCompanyNumber | The ultimate linked company number in the group structure. This is the company number that was first incorporated within the collection of linked companies. · CreditSafe-sourced | string | The Data City | Monthly |
UltimateUKParentCompanyNames | A list of company names belonging to companies in the group that are the highest level UK registered companies. · CreditSafe-sourced | array[string] | The Data City | Monthly |
UltimateUKParentCompanyNumbers | A list of company numbers belonging to companies in the group that are the highest level UK registered companies. · CreditSafe-sourced | array[string] | The Data City | Monthly |
Company
| Field name | Description | Data type | Source | Updated |
|---|---|---|---|---|
CompanyCategory | Companies House data. The type of company, e.g. Publically Listed (PLC) or Limited (LTD). · Download name: Companycategory | string | Companies House | Monthly |
CompanyName | Companies House data. Companies have the ability to change their company name through submission to Companies House. · Download name: Companyname | string | Companies House | Monthly |
CompanyNumber | Companies House data. Unique key, doesn’t change. Most important part of our data. · Related: Companies House guide · Download name: Companynumber | string | Companies House | Monthly |
CompanyStatus | The company status as per Companies House. | string | The Data City | Monthly |
CountryOfOrigin | Companies House data. Not to be confused with group structure (ParentCompanyNation, UltimateParentCompanyNation). · Download name: Countryoforigin | string | Companies House | Monthly |
IncorporationDate | Companies House data. · Download name: IncorporationDate/SortableIncorporationDate | string | Companies House | Monthly |
RegisteredAddress | Companies House data. The companies registered address as registered on Companies House. This is not to be confused with their head office. Not sourced from CreditSafe. · Related: Companies House guide · Download name: Registeredaddress | string | Companies House | Monthly |
RegisteredPostcode | Companies House data. Not to be confused with head office (we do not know this at this stage). Not sourced from CreditSafe. · Related: Companies House guide · Download name: Registeredpostcode | string | Companies House | Monthly |
Social Media
| Field name | Description | Data type | Source | Updated |
|---|---|---|---|---|
Bluesky | Takes the most common anchor link which contains the social media domain. We prioritise social media links that are found on the homepage. | string | Web Scraping | Monthly/Quarterly |
Facebook | Takes the most common anchor link which contains the social media domain. We prioritise social media links that are found on the homepage. | string | Web Scraping | Monthly/Quarterly |
Instagram | Takes the most common anchor link which contains the social media domain. We prioritise social media links that are found on the homepage. | string | Web Scraping | Monthly/Quarterly |
Linkedin | Takes the most common anchor link which contains the social media domain. We prioritise social media links that are found on the homepage. | string | Web Scraping | Monthly/Quarterly |
Twitter | Takes the most common anchor link which contains the social media domain. We prioritise social media links that are found on the homepage. | string | Web Scraping | Monthly/Quarterly |
Youtube | Takes the most common anchor link which contains the social media domain. We prioritise social media links that are found on the homepage. | string | Web Scraping | Monthly/Quarterly |
Financials
| Field name | Description | Data type | Source | Updated |
|---|---|---|---|---|
BestEstimateEBITDA | Where OPERATING_PROFIT, DEPRECIATION_OF_TANGIBLES and AMORTISATION_OF_INTANGIBLES are available, these three values are summed. · CreditSafe-sourced · Download name: EBITDA | number | The Data City | Monthly |
BestEstimateTurnover | Available from Companies House for large companies, if absent then The Data City provide an estimate. The BestEstimate will report actual (reported) where available or it will be an estimate. BestEstimateTurnover currently refers to the latest complete year. · CreditSafe-sourced · Related: The Data City guide · Download name: BestEstimateCurrentTurnover | integer | The Data City | Monthly |
CompanyFinancials_CreditSafe | Annual financial reporting from filings processed by Creditsafe. Creditsafe use a semi-automated process to convert scanned financial PDF documents into numerical/digital values. This process may lead to errors, e.g. reporting profit as turnover. · CreditSafe-sourced · Related: CreditSafe guide · Download name: CompanyFinancialsCreditSafe · Nested object — see CleanCreditSafeCompany | array[cleancreditsafecompany] | CreditSafe | Monthly |
EstimatedGVA | We have produced an estimated GVA measure at the company level. We have employee and SIC information for each company. This makes it possible to estimate a GVA value per company by multiplying a company’s number of employees by the standard GVA employee contribution associated with that same company’s SIC. · CreditSafe-sourced · Related: ONS; BRES guide · Download name: BestEstimate_CurrentGVA | number | ONS; BRES | Monthly/Quarterly |
EstimatedUKGVA | We have produced an estimated GVA measure at the company level. We have employee and SIC information for each company. This makes it possible to estimate a GVA value per company by multiplying a company’s number of UK employees by the standard GVA employee contribution associated with that same company’s SIC. · CreditSafe-sourced · Related: ONS; BRES guide · Download name: BestEstimate_CurrentUKGVA | number | ONS; BRES | Monthly/Quarterly |
Funding
| Field name | Description | Data type | Source | Updated |
|---|---|---|---|---|
_360GivingFunding | Data provided by a charity that helps organisations to publish open, standardised grants data, and supports people to use it to improve charitable giving. Source: https://grantnav.threesixtygiving.org/ · Related: 360Giving guide · Download name: 360GivingFunding · Nested object — see BasicGrant360 | array[basicgrant360] | 360Giving | Monthly |
DealroomFunding | The Data City has matched a considerable amount of Dealroom’s companies (with their help) to a CompanyNumber with >95% confidence. For each company, round data is available for all users. TotalFundingRaised_GBP_Million is the sum of all rounds, except Debt and Acquisition rounds. This is the total across Dealroom’s tracking period and is not the total in one particular year. · Related: Dealroom guide · Nested object — see DealroomPerRoundFunding | array[dealroomperroundfunding] | Dealroom | Monthly |
InnovateUKFunding | Information about the projects funded by Innovate UK from 2004. Source: https://www.ukri.org/publications/innovate-uk-funded-projects-since-2004/ · Related: InnovateUK guide · Nested object — see InnovateUKFundingClean | array[innovateukfundingclean] | InnovateUK | Monthly |
Gender
| Field name | Description | Data type | Source | Updated |
|---|---|---|---|---|
GenderFoundedCategory | The company’s single gender-leadership category based on its active founders. One of: All women led, Majority women led, Mixed led, Majority men led, All men led, Uncertain. Mutually exclusive — every company falls in exactly one. · CreditSafe-sourced · Related: The Data City guide · Download name: gender_founded_category | string | The Data City | Monthly |
GenderLedCategory | The company’s single gender-leadership category based on its active directors. One of: All women led, Majority women led, Mixed led, Majority men led, All men led, Uncertain. Mutually exclusive — every company falls in exactly one. · CreditSafe-sourced · Related: The Data City guide · Download name: gender_led_category | string | The Data City | Monthly |
WomenLedStats | For each company we have identified the likely founders. We have used their declared title to assign them a gender. We distinguish between founders that have founded a company, but are no longer active at the company and founders that are still active at the company. · CreditSafe-sourced · Related: The Data City guide | womanledstats | The Data City | Monthly |
Group Structure
| Field name | Description | Data type | Source | Updated |
|---|---|---|---|---|
CompanyNation | The company nation. · CreditSafe-sourced | string | Companies House | Monthly |
ParentNation | The nation of the parent company. · CreditSafe-sourced | string | Companies House | Monthly |
UltimateParentNation | The nation of the ultimate parent company. · CreditSafe-sourced | string | Companies House | Monthly |
Job Postings
| Field name | Description | Data type | Source | Updated |
|---|---|---|---|---|
CompanyJobPostings | Job postings data is sourced from Lightcast. Currently this data is not available to all customers. We offer this as an upgrade. Lightcast take individual job postings from multiple sources and classify these jobs into SOC4 codes. SOC stands for Standard Occupational Classifications and is a common classification of occupational information for the UK. We have matched 1.3m Lightcast companies to our 1.7m~ website matched companies. The data represents aggregations for the last two years. We set a minimum count threshold for each category. For each job posting, they also pull keywords related to skills. · Nested object — see CompanyJobPosting | array[companyjobposting] | Lightcast | Quarterly |
CompanyJobPostingsByLot | Job postings data is sourced from Lightcast. Currently this data is not available to all customers. We offer this as an upgrade. Lightcast take individual job postings from multiple sources and classify these jobs into Lightcast Occupational Taxonomy (LOT) codes. We have matched 1.3m Lightcast companies to our 1.7m~ website matched companies. The data represents aggregations for the last two years. We set a minimum count threshold for each category. For each job posting, they also pull keywords related to skills. · Nested object — see CompanyJobPostingByLot | array[companyjobpostingbylot] | Lightcast | Quarterly |
CompanyJobPostingsBySkill | Job postings data is sourced from Lightcast. Currently this data is not available to all customers. We offer this as an upgrade. Lightcast take individual job postings from multiple sources and classify these jobs into SOC4 codes. SOC stands for Standard Occupational Classifications and is a common classification of occupational information for the UK. We have matched 1.3m Lightcast companies to our 1.7m~ website matched companies. The data represents aggregations for the last two years. We set a minimum count threshold for each category. For each job posting, they also pull keywords related to skills. · Nested object — see CompanyJobPostingByCommonSkill | array[companyjobpostingbycommonskill] | Lightcast | Quarterly |
Keywords
| Field name | Description | Data type | Source | Updated |
|---|---|---|---|---|
InnovationKeywords | We have a large list of emerging economy keywords. We loop through the web text to see if the web text contains the keyword. If they do, the keyword is assigned to the company. · Related: Web Scraping guide | array[string] | Web Scraping | Monthly |
ManufacturingKeywords | We have a large list of manufacturing keywords. We loop through the web text to see if the web text contains the keyword. If they do, the keyword is assigned to the company. | array[string] | Web Scraping | Monthly |
SectorKeywords | We have a large list of emerging economy keywords. We loop through the web text to see if the web text contains the keyword. If they do, the keyword is assigned to the company. · Related: Web Scraping guide · Download name: Sectorkeywords | array[string] | Web Scraping | Quarterly |
Other
| Field name | Description | Data type | Source | Updated |
|---|---|---|---|---|
EmailAddresses | Emails are extracted from the website text. Director emails are prioritised. For matching directors to emails: any combination of the complete surname or first name with at least the first initial of the opposing field. No AI is used. · Download name: Email | array[string] | Web Scraping | Monthly/Quarterly |
PhoneNumbers | CreditSafe provide The Data City with CTPS approved phone numbers. Phone numbers are extracted from the website text. · CreditSafe-sourced · Related: CreditSafe guide · Download name: Telephone | array[string] | CreditSafe | Monthly/Quarterly |
BestEstimateEmployees | Employees data is available for most companies. Occasionally there are gaps in reporting. Where there are gaps in reporting, we estimate the number of employees; this includes a projection of employees forwards and backwards. BestEstimateEmployees refers to the current year. BestEstimateEmployees may include overseas employees. · CreditSafe-sourced · Related: The Data City guide · Download name: BestEstimateCurrentEmployees | integer | The Data City | Monthly |
BestEstimateUKEmployees | Similar to BestEstimateEmployees, but estimates only UK-based employees. BestEstimateUKEmployees currently refers to the latest complete year. · CreditSafe-sourced · Related: The Data City guide | integer | The Data City | Monthly |
IsBCorp | A flag of whether a company is a B Corp. This is based on a list of B Corps that we have obtained from B Corp and matched to our companies. · Related: BCorp and The Data City guide | boolean | BCorp and The Data City | Quarterly |
IsLikelySPV | A flag indicating the company is likely a special purpose vehicle (SPV), based on an ML model applied to Companies House data. | boolean | The Data City | Quarterly |
CompanyGrowthTrends | Based on the company growth trends data. · Related: The Data City guide | companygrowthtrend | The Data City | Monthly |
HighGrowth | A boolean value indicating whether the company is a high growth company. · Related: The Data City guide | boolean | The Data City | Monthly |
SimilarCompanies | Our similarity score adopts measures of mathematical similarity to find similar companies using their website text. Companies with website text with the same words are more likely to report a higher level of similarity. · Related: The Data City guide · Nested object — see CompanySimilarity | array[companysimilarity] | The Data City | Monthly/Quarterly |
SimilarCompositeCompanies | Our composite similarity score. · Related: The Data City guide · Nested object — see CompanySimilarity | array[companysimilarity] | The Data City | Monthly/Quarterly |
CompanyExports | HMRC export trade data for the company. · Related: HMRC guide | array[companytrade] | HMRC | Monthly |
CompanyImports | HMRC import trade data for the company. · Related: HMRC guide | array[companytrade] | HMRC | Monthly |
CompanyDescription | Takes the description from the description meta tag from the homepage and the about page. We check this is in English. · Download name: Description | string | Web Scraping | Monthly/Quarterly |
Homepage_domain | Goes through the large web-scraping/matching process. · Related: Web Scraping guide · Download name: URLs | string | Web Scraping | Monthly/Quarterly |
ESGStatementAnchors | This is an array of ESG statement anchors. Each anchor is a string that is found in the company’s website text. · Nested object — see ESGStatementAnchor | array[esgstatementanchor] | The Data City | Monthly |
TonnesOfC02equivGHGPerYear | We estimate greenhouse gas emissions based on a published table of total carbon emissions by SIC sector. We assume that carbon emissions are spread equally across all companies within the SIC sector in proportion to their number of employees. Therefore, we only hold an estimate for a limited number of companies. · Related: The Data City guide | number | The Data City | Monthly |
InnovationScore | We present this on the platform as stars but the API is actually a numerical value which gets categorised. The result is a binary classifier. The company is either innovative or it is not. The confidence increases the higher the score. · Related: The Data City guide | number | The Data City | Quarterly |
LocationDetails | An array of location objects with postcode-related fields. · CreditSafe-sourced · Related: Companies House guide · Nested object — see PostcodeDetail_grouped | array[postcodedetail_grouped] | Companies House | Monthly |
Officers | These are officers as per Companies House. E.g. https://find-and-update.company-information.service.gov.uk/company/10958787/officers · CreditSafe-sourced · Nested object — see Officer | array[officer] | Companies House | Monthly |
URLMatchStats | Based on the URL matching process. | urlmatchstat | The Data City | Monthly/Quarterly |