Web URLs Were Scraped Daily for Content
The customer, a Media Consulting company, wanted to structure and analyze large unstructured datasets arising from brand and asset marketing. They wanted to provide direction to their partners on creative, media, messaging, and communication strategies for digital ad campaigns.
Quantiphi developed an end-to-end platform to inject multiple data sources into AWS Cloud and created a data lake on Amazon S3. Multiple different data sources were combined and transformed using AWS Glue to create a data-ready platform for analysis. An automated solution was also developed, that was able to scrape content from more than 5M web URLs daily to identify the content preference of users and generate insights on user behavior.