Table of contentsDocumentation Index
Fetch the complete documentation index at: https://docs.thedatacity.com/llms.txt
Use this file to discover all available pages before exploring further.
Overview
ML lists allow you to build and classify company in a matter of minutes, using our AI tech. To build a machine learning list you will need to train the algorithm to capture companies you are interested in and exclude the rest. This is an iterative process: you will build your lists in stages until you get the desired output. To start building your list you will need:- A set of companies that you know are good representatives of the industry
- A set of keywords that define the activity of the companies in the industry.
Step 1 - Getting started
To get started, head to MY LISTS and click ‘Create a New List’. Add the first example companies using one or both of the methods shown in the video. You can either insert company numbers or use the keyword search engine to find good examples.Video to re-embed. This guide originally embedded a HubSpot video (id 152442612400). Add an updated embed link in this article.

Step 2 - Training your list
Start training your list to exclude companies that you don’t want in the list by adding these to the negative training set by clicking the “x” button. You can use the keyword filter or the score value to identify these companies - companies at the end of the list will have lower score values.Video to re-embed. This guide originally embedded a HubSpot video (id 152447307109). Add an updated embed link in this article.
Step 3 - Finalising your list
Repeat this process until you get a representative list. You can add more companies to your positive training set by clicking the ”+” button. Important considerations:- The companies that you add to your training sets lead the classification, not the keywords that you use to filter the list.
- The keywords that you use to filter the list are helpful to find companies for the training sets.
- Once the algorithm has generated a list, you can manually add or remove companies. This is useful when you want to add a company without impacting the training sets or the classifier terms, or when a company does not have an available website.
Video to re-embed. This guide originally embedded a HubSpot video (id 152473222933). Add an updated embed link in this article.
- Check the big companies and manually exclude them if they are not relevant. Big multinational companies may use relevant language on their website, but might not be a good representative of the sector in question. This is important because they will skew the results for the overall performance of the sector in subsequent analysis if left in.