In all our POI Data Packs information is organised by a unique identifier used to recognise the point of interest: the poi_id
field. This field is used as a foreign key to link the different files and/or packages together.
As a customer you might want to complement our information with some data you already own. Provided that you have a list of VAT Identification Numbers we can provide a service to match your list to our poi_id
field. In this way you can then establish a clear link between our information and your pre-existing data.
Technical information
Input file
In order to make this match information we do need an input file from your side. This file must be a CSV file with the following requirements:
- UTF-8 encoding
- New line delimiter:
LF
- Field delimiter: comma
,
- Escape character: double quotes
“
We need the file to contain TINs list as outlined below:
Field Name | Description | Mandatory | Type |
vat_id | VAT Identification Number. Currently supported VATs are:
VAT number (Italy) - 11 digits
Tax ID (Italy) - 16 digits
CNPJ (Brazil) - 14-digit number formatted as XX.XXX.XXX/0001-XX | YES | STRING |
Output file
As output of this process you will receive a file called VAT Verified Association with fields as defined below:
Field Name | Description | Type |
vat_id | VAT Identification Number value given as input. | STRING |
poi_id | Our own unique POI identifier.
If no match is detected this field is NULL | STRING |
confidence | A number between 1 and 100 to express our level of confidence in matching the provided VAT with the returned poi_id
The higher this number the higher the chance the match is correct.
If no match is detected this field is NULL
For additional details please refer to FAQs section. | DOUBLE |
🧐 FAQs
The confidence level in our VAT Matcher system is calculated using a comprehensive set of matching rules. These rules are designed to accurately attribute a VAT number to a point of interest by analyzing various data points such as business name, category, phone number, and others. If all these matching rules are met, the system assigns a 100% confidence level, indicating a perfect match. Conversely, if none of the rules are met, the confidence level is set at 0%, reflecting a lack of match. This method ensures a precise and reliable VAT identification process by evaluating each potential match against a robust and detailed rule set.
In our VAT Matcher, we have set a minimum acceptable confidence threshold at 80%. This threshold is established to ensure a high level of accuracy and reliability in the matches we provide. It represents a balanced approach between maintaining a high standard of match quality and capturing a broad range of potential valid identifications. Matches that meet or exceed this 80% threshold are considered to have a high level of confidence and are therefore included in our outputs.
In the current version of the VAT Matcher, our approach is to include only those matches that have a confidence level above the 80% threshold. This policy is based on our commitment to maintain high standards of accuracy and reliability in the data we provide. By focusing on matches that exceed this threshold, we ensure that our users receive only the most reliable and trustworthy matches. This approach helps in filtering out less certain matches, thereby enhancing the overall quality and dependability of our VAT matching service.