We use website text data to classify companies into RTICs. Not all companies on the platform have an RTIC associated with them. These are the most common reasons as to why:Documentation Index
Fetch the complete documentation index at: https://docs.thedatacity.com/llms.txt
Use this file to discover all available pages before exploring further.
- We have not yet matched a company in Companies House with a website. We can only obtain text data after successfully linking a company from Companies House to its website. If we do not have website text data for a company, we won’t be able to classify it into an RTIC. We have >5.4 million individual companies on the platform, of which >1.6m have a URL match.
- We built an RTIC before a company was founded or changed its website text. If we build an RTIC before the foundation of a company or a company’s change of processes/technologies, it won’t be in the RTIC until the next iteration.
- Some companies do not work in the emergent sectors we target. This especially applies to companies working in foundational industries.
- Some companies may use relevant processes or technologies but the way they describe them is not similar enough to the companies selected as training data.
- They are false negatives and we missed them. Our RTICs have a minimum confidence level of 90%. We acknowledge some companies’ classification falls through the cracks. When we use the term “false negative” we mean the algorithm classified a company as not being relevant for the sector when it should be. This is regularly fixed in later iterations of RTIC building and with the help of industry experts and our users.
Video to re-embed. This guide originally embedded a HubSpot video (id 152318682193). Add an updated embed link in this article.
RTICs: Find out more about RTICs and how they are built in our What are RTICs? guide.