Overview

- Pearson VUE is one of the largest education and book publishing company in the world.
- They use datasets acquired by Grepsr to measure and drive insights.
- They need to evaluate and compare content across hundreds of thousands of websites on a recurring basis.
- Their in-housing data processing was inefficient and extremely limited to meet increasing demands.
- Grepsr helps them automate routine extractions, massive improving turnaround times and scale operations throughout their regional offices.
Challenges
Although Pearson VUE authorized test centers will say they recognize test A or test B and they will put that on their websites. However, to find and assess the level at which these programs are recognized and listed – as new centers are added and the structure of webpages changes is not easy.
The scope to analyze is vast and complex. There are over 5000 of Pearson VUE test centers websites with hundreds of thousands of webpages, sub-paragraphs, image to scan, compare and evaluate on a frequent and recurring basis. In addition, many test centers websites’ displays required information only upon search, interrelationships and links that also change unpredictably.
Pearson VUE’s Director of International Client Relations, Charles Hamilton, “translates” this information to build targets for his team to drive further penetration and adoption of their test programs. The challenge for Charles is to quickly sift through the overwhelming volume of content to deliver targeted information and timely insights.
“We needed a benchmark that we could use as a YOY indicator to assess our team’s performance and get listed on these institution websites. It was a business performance metric that we were really missing!”
5X
600K+
500+
Solutions
Before using Grepsr, Pearson VUE’s client relations team spent many hours every day going through test centers’ websites, scanning through thousands of webpages, downloading brochures, and classifying and storing hundreds of thousands of metadata for analysis, but they are all happening in the wrong places and are siloed. The process was inefficient, inconsistent, and not scalable to meet demand and left them with little time to evaluate relevancy and provide new services.
Pearson VUE uses Grepsr to automate the manually-intensive data collection, and pre-business development processes – more than 500 automated information flows in total. This has freed both Charles and his teams’ time for deeper analysis and other domain-related tasks. Now Charles and his team are able to track their performance quicker, and more efficiently, accurately and consistently.
Using Grepsr to automate the error-prone manual metadata collection process significantly reduced the overall turnaround time by over 5 times. This also enabled Charles to establish a business performance metric that allowed him to quickly build targets for his teams in different countries, conduct annual assessment of their performance, and feed that into their bonus and incentives.

Similar challenges faced across the industry:
Lack of technical know-how to automate routine data extractions
Businesses need fresh data to gather the best insights. To that end, one or two data extractions a day does not suffice. They need a system that can easily schedule crawl runs at specific intervals as well as on demand.
Lack of resources - time, money and personnel - for data sourcing at scale
Data extraction is extremely tedious and highly error-prone. Most businesses lack the infrastructure to perform high volumes of data sourcing, and at a quality that yields the best results.
Overcoming source website’s restrictions
Most websites place limits on how many requests can be made in a set time period, and regularly block bots from accessing their content.
Getting started with Grepsr
Start with Grepsr in a few easy steps. Leave the data sourcing heavy lifting to us, so you can focus on innovation and growth.
Initial project consultation
Instrument web crawlers
Begin data collection
Hassle-free maintenance
Data acquisition doesn’t have to be a burden
Offload your routine data extractions on our able hands, and focus on more pressing matters.