Track Changes in Your CSV Data Using Python and Pandas

Written by Asmit Joshi onJanuary 31, 2018

So you’ve set up your online shop with your vendors’ data obtained via Grepsr’s extension, and you’re receiving their inventory listings as a CSV file regularly. Now you need to periodically monitor the data for changes on the vendors’ side — new additions, removals, price changes, etc.

While your website automatically updates all this information when you import from the CSV file, you might sometimes want to see for yourself or display to your customers what changes your vendors have made to their stock.

Let’s take the same example of Teva as in the previous blog, and see how you can easily compare the old and new data sets, and track the changes.

Using this tutorial (Thanks, Chris Moffitt, for the awesome post!) as a guide and making a few modifications, you can set up a project to work with CSV files instead of Excel spreadsheets.

For this blog, I’m assuming you have Python and Pandas packages installed on your system and you’re familiar with at least the basics of programming. Now, you can easily follow along and customize the code to suit your situation.

Data to make or break your business

Get high-priority web data for your business, when you want it.

Get started

Before we start, let’s get our files ready. If you haven’t already, head over to the project on your Grepsr app dashboard and browse through the calendar to see when your crawler was run. When you click on the highlighted dates, you’ll see the time of the crawl on that day, after which you can go to the Download tab below the calendar to download your data for that particular crawl.

If you want the latest data, simply re-run your crawler by going to the Configure & Run tab, and download the file once the web scraping is complete.

scr_calendar — Project crawl times on the Grepsr calendar

All set? Let’s get started!

Step-1: Make life easier by structuring the files

Our first course of action will be to figure out how we can filter unwanted content and create easily manageable files using Python and Pandas.

Our old and new datasets are tevasale_jan10.csv and tevasale_jan26.csv respectively. Here’s a simple code to structure the files:

import pandas as pd

# Reading content from the CSV files
old = pd.read_csv('Teva_files/tevasale_jan10.csv')  
new = pd.read_csv('Teva_files/tevasale_jan26.csv')

# Replacing newlines in the Colors and Sizes columns with " | " as separator
old['Colors'] = old['Colors'].str.replace('n+', ' | ')
new['Colors'] = new['Colors'].str.replace('n+', ' | ')
old['Sizes'] = old['Sizes'].str.replace('n+', ' | ')
new['Sizes'] = new['Sizes'].str.replace('n+', ' | ')

# Removing "Model: " prefix in the Model column
old['Model'] = old['Model'].str.replace('Model: ', '')
new['Model'] = new['Model'].str.replace('Model: ', '')

# Replacing newlines and white-spaces in the Name column with " | " separating the category and name
old['Name'] = old['Name'].str.replace(''s(ns+)', ''s | ')
new['Name'] = new['Name'].str.replace(''s(ns+)', ''s | ')

# Removing empty rows using the Name column as reference
old = old.dropna(subset=['Name']).reset_index(drop=True)
new = new.dropna(subset=['Name']).reset_index(drop=True)

# Writing the structured data to new CSV files
old.to_csv('Teva_files/tevasale_old.csv', index=False)
new.to_csv('Teva_files/tevasale_new.csv', index=False)

Let’s see what our structured file looks like.

scr_csv_file_edited — Data in `tevasale_old.csv`

Data to make or break your business

Get high-priority web data for your business, when you want it.

Get started

Step-2: Find changes in your data and save to a new file

Now that we’ve refined our data, we can proceed with Python to compare two files.

The code for comparing our two CSV files tevasale_old.csv and tevasale_new.csv, and exporting the changes to another CSV file tevasale_changes.csv is as follows:

import pandas as pd

file1 = 'Teva_files/tevasale_old.csv'
file2 = 'Teva_files/tevasale_new.csv'
file3 = 'Teva_files/tevasale_changes.csv'

cols_to_show = ['Model', 'Price', 'Original Price', 'Colors', 'Sizes']

old = pd.read_csv(file1)
new = pd.read_csv(file2)


def report_diff(x):
    return x[0] if x[1] == x[0] else '{0} --> {1}'.format(*x)


old['version'] = 'old'
new['version'] = 'new'

full_set = pd.concat([old, new], ignore_index=True)

changes = full_set.drop_duplicates(subset=cols_to_show, keep='last')

dupe_names = changes.set_index('Name').index.get_duplicates()

dupes = changes[changes['Name'].isin(dupe_names)]

change_new = dupes[(dupes['version'] == 'new')]
change_old = dupes[(dupes['version'] == 'old')]

change_new = change_new.drop(['version'], axis=1)
change_old = change_old.drop(['version'], axis=1)

change_new.set_index('Name', inplace=True)
change_old.set_index('Name', inplace=True)

diff_panel = pd.Panel(dict(df1=change_old, df2=change_new))
diff_output = diff_panel.apply(report_diff, axis=0)

changes['duplicate'] = changes['Name'].isin(dupe_names)
removed_names = changes[(changes['duplicate'] == False) & (changes['version'] == 'old')]
removed_names.set_index('Name', inplace=True)

new_name_set = full_set.drop_duplicates(subset=cols_to_show)

new_name_set['duplicate'] = new_name_set['Name'].isin(dupe_names)

added_names = new_name_set[(new_name_set['duplicate'] == False) & (new_name_set['version'] == 'new')]
added_names.set_index('Name', inplace=True)

df = pd.concat([diff_output, removed_names, added_names], keys=('changed', 'removed', 'added'))
df[cols_to_show].to_csv(file3)

Let’s see what we’ve done here with the help of Python and its Pandas package:

Firstly, we’ve read our files into separate data frames old and new.
Created a report_diff function to account for the changes between the files — it prints old and new values wherever a change has been made.
Added a version column to both data frames to note the origin of each row when we later combine them.
Combined the contents of the two data frames and stored them in another data frame full_set.
Removed duplicate rows, i.e. unchanged data, from full_set and stored the remaining data in changes.
Used the get_duplicates() function to get a list of all names that are duplicated. We named the list dupe_names.
Using isin, got a list of all duplicates, dupes.
Split dupes based on version to two new data frames change_old and change_new.
Removed the version column.
Set Name as our index for both data frames.
Into diff_output we called our report_diff function, and stored the rows where data has been changed.
Then we found out which item is removed from stock and saved it to removed_names.
Now to find all new items, we checked for duplicates again, and filtered each row based on the item’s uniqueness AND presence in the ‘new’ data frame. This list was then saved as added_names.
Finally we merged the three data frames with keys to differentiate the type of change — changed, removed or added — and we’ve written everything into a new CSV file.

At Last

Our final CSV file tevasale_changes.csv looks something like this:

scr_csv_file_changes — All changes after comparing the old and new CSV files

We can clearly observe additions, removals, and changes in details for each item.

Although the dataset used here was relatively small (~70 items in each file), the code still works for much larger data.

This is a helpful tool to track what changes your vendors have made to their stock. Hence you can easily implement them on your website and give your customers up-to-date and accurate information.

Once again, a huge thanks and gratitude to Chris Moffitt, on whose tutorial the codes are based.

Related reads:

Web Scraping with Python: A How-To Guide

Learn how to write a crawler in Python to extract data of the top IMDB movies of all time. Find out key differences between web scraping with PHP vs Python.

Web Scraping with PHP – A How-To Guide

PHP, although used extensively on the web, is dreaded by many. Read this article to perform web scraping with PHP. Bonus: we’ve addressed common mistakes.

Flexible pricing models that suit your enterprise needs

Plans & Pricing

data analysis, productivity, python and pandas, web scraping

BLOG

A collection of articles, announcements and updates from Grepsr

Articles | Knowledge Base | Use Cases April 24, 2024

3 Pillars of a Powerful Data Strategy + Real-Life Examples (2024)

By the time you’re done reading this post, human activity on the web and across devices will generate 27.3 million terabytes of data. According to Bernard Marr, author of Data Strategy, in the 21st century, “every business is a data business.” What information do you want to collect? Where are you going to store the […]

Article | Knowledge Base April 18, 2024

Data Vs Information. Learn Key Differences

Did you know that Netflix – the biggest online streaming service that produces and releases top movies and TV shows (you know, Stranger Things & Squid Game) owes its success to Big Data? Their customer retention rate is 93%, the highest benchmark in the industry. Surely, you’ve glimpsed the term “Big Data” thrown in some […]

Analytics | Articles | Knowledge Base | Use Cases April 8, 2024

RPA is a Replicator: An Organizational Tour De Force

Richard Dawkins’ concept of the “replicator” in his book “The Selfish Gene” provides a fascinating lens through which we can view the rise of Robotic Process Automation (RPA). In the book, Dawkins argues that genes, not organisms, are the true “replicators” in evolution. These self-replicating molecules carry the instructions for building and maintaining life. They […]

Analytics | Article | Articles | Knowledge Base | Use Cases March 27, 2024

How Walmart’s Data Insights Can Power Your Retail Strategy

What do we know about Walmart? We know it’s the largest retailer in the world by revenue, with the company’s global sales crossing $600 billion. We also know that the company has the world’s largest private cloud-based database – Data Café. And finally, it hires the maximum number of data scientists to leverage Big Data. […]

Article March 22, 2024

Common Challenges in Web Scraping and Their Solutions Using RPA

What comes to your mind when I say think of a detective? A sharp mind, a piercing gaze that misses nothing, a sharp long nose, a smoke pipe always resting in his mouth, and a relentless pursuit of truth. A man who stands out for his outstanding investigation skills. Yes, you’re right. It’s Sherlock Holmes! […]

Article | Knowledge Base | Use Cases March 14, 2024

Web Scraping Zillow: A Modern Approach to Real Estate

What comes to mind when we say the word ‘real estate’? Are you thinking of a broker dressed in a pantsuit, with shiny white teeth, walking across a manicured lawn? Or the smell of warm cookies wafting in from an open house with a ‘For Sale’ sign planted in the grass? For decades, buying and […]

Article March 12, 2024

Popular ETL Tools for Web Scraping

Learn about the most popular ETL tools in this blog. Ever felt like you’re searching for a specific detail buried deep within a massive website? That’s the essence of web scraping! And if you’re familiar with finding the needle in a haystack, you’ll understand the challenge. Web Scraping is essential and you must do it. […]

Article March 7, 2024

Transforming Operations: RPA and Web Scraping in Action

Imagine a world where you no longer have to do the repetitive grunt work that neither sparks joy nor creativity. It completely vanishes from your sight as you have digital robots that tirelessly do structural tasks following a regular pattern without any turmoil. As a result, you are released from the shackles of mundane labor. […]

Article March 6, 2024

Mine Reddit’s Billions of Opinions: Web Scraping Reddit and Sentiment Analysis (2024)

In January 2024 alone, there were 7.57 billion visits to Reddit. There are 2.8 million subreddits with discussions on everything imaginable — from r/cats to r/memes and one of our personal favorites, r/dataisbeautiful. These numbers in billions and millions are indicative of Reddit as one of the largest online communities in the world; which makes […]

Explainer | Knowledge Base March 1, 2024

ETL for Web Scraping – A Comprehensive Guide

Dive into the world of web scraping, and data, learn how ETL helps you transform raw data into actionable insights.

Article February 16, 2024

Web Scraping Best Practices for RPA Integration

The new era of RPA- a shift from manual hard work to automated smart work in business. RPA is the process of automating routine and repetitive tasks in business operations. Robotic Process Automation uses technology that is steered by business logic and structured inputs. People might mistake it for a robot doing their mundane jobs […]

Articles | Knowledge Base | Use Cases February 2, 2024

Introduction to Web Scraping & RPA

Web scraping automatically extracts structured data like prices, product details, or social media metrics from websites. Robotic Process Automation (RPA) focuses on automating routine and repetitive tasks like data entry, report generation, or file management. When seamlessly integrated through tools like webhooks or API calls, these technologies can significantly boost an organization’s operational efficiency by […]

Article February 1, 2024

Quantitative Data: Definition, Types, Collection & Analysis

Data is ubiquitous and plays a vital role in helping us understand the world we live in. Quantitative data, in particular, helps us make sense of our daily experiences. Whether it’s the time we wake up in the morning to get to work, the distance we travel to get back home, the speed of our […]

Article January 22, 2024

Extract Google Trends Data by Web Scraping

Approximately 99,000 search queries are processed by Google every passing second. This translates to 8.5 billion searches per day and 2 trillion global searches per year. From the estimated data, we can consider that an average person conducts between three to four searches every day. “Explore what the world is searching” – Google Trends. The […]

Article December 28, 2023

Blog Scraping: Uncover Opportunities for Data-Driven Growth

A study by HubSpot marketing shows that those businesses who publish blogs get 55% more website visitors, 77% more inbound links, and 434% more indexed pages than those who don’t. The ultimate goal of any business is to continually increase its lead conversion rate. Content is essentially what leads the organization to bring more leads […]

Article December 14, 2023

Relevance of Web Scraping in the Age of AI

Artificial Intelligence (AI) has flourished into a rapidly evolving domain of computer systems that can function perfectly in tasks that need human intelligence. Statistics claim that the market volume for AI is projected to reach $738.80 billion by 2030. This essentially means that there is a growing demand for AI-related services, leading to an expansion […]

Article December 11, 2023

ETL Data and Web Scraping Brilliance

Did you know that in a world drowning in information, making sense of raw data from the internet is like finding a needle in a haystack? However, looking at the silver lining, the dynamic duo – ETL and web scraping can unravel the chaos of unlimited, unstructured data into clarity and make sense. ETL is […]

Article November 23, 2023

Buy Box Data: What Every Seller Needs to Know

Did you know, winning the Buy Box can increase your chances of becoming an Amazon best-seller? The Buy Box accounts for 90% of the total sales on the platform, making it crucial for sellers to leverage the Buy Box data. Amazon is at the helm of the overdrive in the e-commerce industry. Living proof of […]

Article November 9, 2023

Boosting Business Intelligence with Managed Data Extraction

Did you know that Lotte, a South Korean conglomerate increased their sales up to $10 million thanks to Business Intelligence? Business Intelligence is the process of collecting, analyzing, and presenting raw data that is transformed into meaningful insights. It involves methodologies that ultimately aid the business in making strategic and actionable data-driven decisions. For a […]

Article November 1, 2023

E-commerce in Overdrive: Unleash the Power of Cyber Monday

In 2022, Cyber Monday accomplished a remarkable feat, propelling e-commerce sales to an impressive $11.3 billion—an extraordinary 5.8% increase, setting a new benchmark for online shopping. As the holiday season approaches, the global culture of bestowing gifts and celebration is also at an all-time high. For these times to be extra special, people look for […]

Article October 19, 2023

Holiday Fleet Management: A Roadmap to Data-Driven Success in Car Rentals

In today’s car rental industry, data isn’t just an option; it’s the key to making pivotal decisions that drive success. The car rental industry is poised for a lucrative path ahead, with a projected revenue surge to $1.9 billion by 2027. The holiday season ignites a desire to explore and experience new places, which, in […]

Article October 6, 2023

The Simplicity of Employing No-Code Web Scraping

Unlock the Power of No-Code Web Scraping: Transform Your Business with Data-Driven Success. Learn how web scraping and external data providers can revolutionize your industry. Explore real-world examples and discover the simplicity of harnessing valuable data.

Article September 20, 2023

Drive Success with Car Rental Data Extraction

Tap into the capabilities of car rental data extraction with Grepsr. Outperform competitors, fine-tune fleet management, and just do more.

Articles | Knowledge Base September 14, 2023

The Power of Web Scraping: Enriching POI Datasets

Discover how web scraping is revolutionizing the extraction and enrichment of POI data, ensuring accuracy and timeliness

Article September 2, 2023

Customer Sentiment Analysis and the Role of Web Scraping

Web scraping is indispensable for any Customer Sentiment Analysis Project. Learn how you can leverage web scraping to your advantage.

Articles September 1, 2023

Mastering Data Visualization in Python with Grepsr’s Data

In a world where data reigns supreme, the ability to make sense of the overwhelming volume of information is nothing short of a superpower. Harnessing the power of data visualization in Python is a superpower in itself. From interactive charts and graphs to immersive dashboards, visualization helps businesses and individuals gain insights from data. But […]

Article | Explainer | Knowledge Base August 18, 2023

Extracting Data from Websites to Excel: Web Scraping to Excel

Web scraping and Excel go hand in hand. After extracting the data from the web, you can then organize this data in Excel to capture actionable insights. The internet, by far, is the biggest source of information and data. Juggling through multiple sites to analyze data can be quite irksome. If you are analyzing vast […]

Articles | Featured July 30, 2023

Five Reasons Why You Need an External Data Provider

Web data extraction of large datasets is almost impossible with in-house capabilities. Learn why you need an external data provider.

Articles | Featured | Knowledge Base | Use Cases July 24, 2023

Analyzing US Job Postings Data to Understand Job Market & Economy

Leveraging one of Grepsr’s job postings data projects to gather insights — the hottest industries and employers, including working conditions

Articles July 20, 2023

Web Scraping for Lead Generation: Open a Portal to Sales

Reaching out to leads and converting them into customers doesn’t have to be a shot in the dark. Web scraping can help you get access to high-quality leads databases and scale your lead generation process.

Articles | Featured June 22, 2023

Web Scraping: An Unlikely Data Solution

Data has now become something of a currency in the twenty-first century. But, when you think of data, does web scraping come to your mind? We’re here to tell you it should.

Articles May 24, 2023

Zero-in on Your Real Estate Prospects with Data

Big Data technologies make real estate prospecting more credible and effective by giving you access to real-time web data. You can use web scraping to gather actionable web data and analyze the real estate market environment on a city block level.

Explainer | Knowledge Base April 28, 2023

Web Scraping with Python: A How-To Guide

Most businesses (and people) today more or less understand the implications of data on their business. ERP systems enable companies to crunch their internal data and make decisions accordingly. Which would have been enough by and itself if the creation of web data did not rise exponentially as we speak. Some sources estimate it to […]

Explainer | Knowledge Base February 28, 2023

How to Perform Web Scraping with PHP

In this tutorial, you will learn what web scraping is and how you can do it using PHP. We will extract the top 250 highest-rated IMDB movies using PHP. By the end of this article, you will have sound knowledge to perform web scraping with PHP and understand the limitation of large-scale data acquisition and […]

Articles January 17, 2023

Why Data Extraction Services are Better Than Tools for Enterprises

The key factors that set a data extraction service apart from its do-it-yourself variant

Announcements | Press Release December 2, 2022

Press Release: Grepsr joins Data Commerce Cloud (DCC) to meet global need for actionable, on-demand DaaS solutions

Dubai, UAE / Berlin, Germany. 1 December 2022 – Grepsr, provider of custom web-scraped data, has become a Premium Partner of Datarade’s Data Commerce Cloud™, the platform which makes data commerce easy. Grepsr’s data products are now available to buy on Datarade Marketplace and other DCC sales channels. Grepsr processes 500M+ records, parses 10K+ web sources, and extracts data […]

Articles November 21, 2022

Screen Scraping: 4 Important Questions for Scoping your Web Project

Screen scraping should be easy. Often, however, it’s not. If you’ve ever used a data extraction software and then spent an hour learning/configuring XPaths and RegEx, you know how annoying web scraping can get. Even if you do manage to pull the data, it takes way more time to structure it than to make the […]

Articles | Explainer | Knowledge Base January 24, 2022

Significance of Big Data in the Tourism Industry

In a post-pandemic reality, big data helps travel agents and travelers make better decisions, minimize risks, and still have memorable holidays.

Articles | Year in Review January 4, 2022

Grepsr’s 2021 — A Year in Review

Our growth and achievements of the past year, and reasons to get excited in 2022

Articles October 22, 2021

Data Analysis: Five Steps to Superior Data

This is one piece of a three-part series that looks at the various data analysis methods, techniques, and essential steps to ensure its superiority. According to Wikipedia, data analysis is a process within data science of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful insights, informing conclusions, and supporting decision-making. Data […]

Articles October 18, 2021

Qualitative and Quantitative Data Analysis Methods

This is one piece of a three-part series that looks at the various methods, techniques, and essential steps to ensure superior data analysis. The majority of leaders from high-performing businesses attribute their success to data analytics. According to a survey done by McKinsey & Company, respondents from these companies are three times more likely to […]

Articles September 28, 2021

Make Data Make Sense: Most-Used Techniques in Data Analysis

This is one piece of a three-part series that looks at the various methods, techniques, and essential steps to superior data analysis.

Articles September 10, 2021

A Smarter MO for Data-Driven Businesses

Data is key to future-proofing your brand. Web scraping is the first step towards achieving long-term data-driven business success.

Articles August 26, 2021

Business Data Analytics — Why Enterprises Need It

Objectivity vs subjectivity The stories we hear as children have a way of mirroring the realities of everyday existence, unlike many things we experience as adults. An old folk tale from India is one of those stories. It goes something like this: A group of blind men goes to an elephant to find out its […]

Articles | Featured August 11, 2021

Perfecting the 1:10:100 Rule in Data Quality

Never let bad data hurt your brand reputation again — get Grepsr’s expertise to ensure the highest data quality

Articles July 21, 2021

Data Visualization Is Critical to Your Business — Here Are 5 Reasons Why

Data visualization is a powerful tool. When done correctly, it is a much more elegant method of explaining even complex concepts compared to lengthy texts and paragraphs. Maps and graphs have existed since the 17th century as a means of visualizing data. It was in the mid-1800s that the world saw one the first examples […]

Articles | Knowledge Base July 2, 2021

What is Data Normalization & Why Enterprises Need it

In the current era of big data, every successful business collects and analyzes vast amounts of data on a daily basis. All of their major decisions are based on the insights gathered from this analysis, for which quality data is the foundation. One of the most important characteristics of quality data is its consistency, which […]

Articles | Featured June 16, 2021

Benefits of Using Web Scraping to Extract Airfare Data from OTAs

Use web scraping to extract airfare data from OTAs and airlines’ websites to give your customers the best possible start to their holiday experience.

Articles | Featured May 17, 2021

Legality of Web Scraping — An Overview

Ever since the invention of the World Wide Web, web scraping has been one of its most integral facets. It is how search engines are able to gather and display hundreds of thousands of results instantaneously. And also how companies build databases, develop marketing strategies, generate leads, and so on. While its potentials are immense, […]

Explainer | Knowledge Base May 4, 2021

Image Scraping — What is It & How is It Done?

From retail and real estate to tourism and hospitality, images play a vital role in influencing customer decisions. Hence, it is important for brands to see what kinds of photos are turning prospects into customers. On the other side, customers go through numerous products and images before settling on a final choice. Similarly, analysts browse […]

Articles | Featured April 26, 2021

Data Scraping from Alternate Sources — PDF, XML & JSON

An unconventional format — PDF, XML or JSON — is just as important a data source as a web page.

Announcements | Featured | Knowledge Base | Product April 16, 2021

QA at Grepsr — How We Ensure Highest Quality Data

Ever since our founding, Grepsr has strived to become the go-to solution for the highest quality service in the data extraction business. In addition to the highly responsive and easy-to-communicate customer service, we pride ourselves in being able to offer the most reliable and quality data, at scale and on time, every single time. QA […]

Articles April 6, 2021

Benefits of High Quality Data to Any Data-Driven Business

From increased revenue to better customer relations, high quality data is key to your organization’s growth.

Articles March 26, 2021

Five Primary Characteristics of High-Quality Data

Big data is at the foundation of all the megatrends that are happening today. Chris Lynch, American writer More businesses worldwide in recent years are charting their course based on what data is telling them. With such reliance, it is imperative that the data you’re working with is of the highest quality. Grepsr provides data […]

Articles | Knowledge Base March 2, 2021

11 Most Common Myths About Data Scraping Debunked

Data scraping is the technological process of extracting available web data in a structured format. More businesses globally are realizing the usefulness and potential of big data, and migrating towards data-driven decision-making. As a result, there’s been a huge rise in demand in recent years for tools and services offering data for businesses via Data […]

Articles February 23, 2021

Common Challenges During Amazon Data Collection

Over the last twenty years, Amazon has established itself as the world’s largest ecommerce platform having started out as a humble online bookstore. With its presence and influence increasing in more countries, there’s huge demands for its inventory data from various industry verticals. Almost all of the time, this data is acquired via web scraping […]

Analytics | Articles February 12, 2021

Customer Review Insights: Analyzing Buyer Sentiments of Amazon Products

Actionable insights from Amazon reviews for better decision-making

Articles | Year in Review January 12, 2021

A Look Back at Grepsr’s 2020

A brief look at Grepsr's achievements in data extraction and industry reach in 2020, and a glimpse into 2021 plans.

Announcements | Product | Updates July 31, 2020

Our Newly Redesigned Website is Live!

We’ve redesigned our website to make it easier for you to find what you’re looking for

Announcements | Updates March 31, 2020

Preview the New Look Grepsr App

Everybody’s favorite big data tool is getting a fresh coat of paint (and some behind-the-scenes tweaks)

Articles March 11, 2020

Role of Data Mining During the COVID-19 Outbreak

How web scraping and data mining can help predict, track and contain current and future disease outbreaks

Articles | Year in Review January 3, 2020

Grepsr’s 2019 — A Year (and Decade) in Review

Time flies when you’re having fun

Announcements | Feature | Featured November 11, 2019

Introducing Grepsr’s New Slack-like Support

Making our data acquisition specialists more accessible to busy professionals

Announcements | Feature | Knowledge Base June 18, 2019

Introducing Grepsr’s Data Quality Report

Quality assured data to help you make the best business decisions

Knowledge Base | Video Tutorials January 30, 2019

Report History/Activity on the Grepsr App

A walk-through detailing your report history and how to access (and download) your report’s data from historic crawl runs

Articles | Year in Review January 3, 2019

Grepsr’s 2018 — A Year in Review

As we say hello to 2019, everyone here at Grepsr firstly wishes our readers and valued customers a very Happy New Year! We look forward to your continued love and support in the new year and beyond. Here’s a look back at some of Grepsr’s highlights in 2018. New Product In addition to our existing […]

Announcements | Knowledge Base | Policy November 1, 2018

Data Retention in Grepsr

New policy announcement

Knowledge Base | Video Tutorials September 6, 2018