In total, there are over 500.000 websites which are under the magnifying glass, with 24,000 being added each month.
EMB allows you to gain the following information for company websites:
Observation: an alarming situation according to Pricewaterhouse Cooper
There are many data mining solutions and very few of them are efficient. EMB has developed its own, which is made to measure, and reflects the quality required.
This solution is designed to recognise official data but much more besides. Verifying data – such as details of a company’s headquarters, an activity or telephone number -, needs context to be provided for the information being extracted and it also needs to provide more than is strictly necessary.
The more you harvest and structure the information, the more markers you will have to verify the data that is ultimately being targeted.
From its dedicated servers, with its specific algorithms, EMB collects billions of items of data each day and structures them in its database. EMB also has the ability to modify its algorithms at any time to identify new pieces of data.
How can you find out if a company practices e-commerce?
In B-to-B, for example, when there is no online payment where can one identify oneself on a client extranet?
Our algorithm scans the sites in search of over 90 markers for this single piece of information: secure pages, redirection towards payment solution, search for specific key words on the pages or URLs, etc.
During the qualification phase, these 90 markers verify whether there is e-commerce, what type, what payment method, etc.
This application has been developed especially and operates outside of the web. It is structured around powerful algorithms and exists to:
LINKS IS INSPIRED BY THE MOST ADVANCED QUANTITIVE AND QUALITATIVE DATA ANALYSIS METHODOLOGIES.
For telephone numbers, all numbers are formatted according to a single format of the type 00331271254960.
The application will:
Among the email addresses extracted from a site, the application “Links” is capable of:
AS PART OF THIS PROCESS, A CONFIDENCE LEVEL IS ATTACHED TO EACH ITEM OF DATA
Only data that is judged to be trustworthy via an indicator system will be approved.
Data that is irregular, subject to risk or conflict will either be ruled out immediately or will undergo subsequent checks.
Any data that is missing or that does not reach the required quality coefficient will be required to undergo a further stage of certification.
It shall be subject to a human, manual check.
This stage is carried out in partnership with a number of approved companies.
Our data is regularly used by our clients, on average 5 to 6 times per month.
We also subject to monthly monitoring (see Step #1).
This means we are able to guarantee frequent updates.
There is no other Web Data Management company with expertise than can rival ours, nor with methods that are as tried and tested.
DATA ANALYSIS IS AN ONGOING, STANDARDISED PROCESS, DIVIDED UP INTO VARIOUS PHASES.
Our bases evolve constantly and are always being updated.
CAN WE GUARANTEE 100% QUALITY? NO… BUT 99.99%... ABSOLUTELY!