Skip to content

hariveliki/wdb-mini-challenge-2

Repository files navigation

Introduction and Instructions

The idea behind this repository is to enrich existing products (retail) with a specific data structure,
from the description given at the vendors online store.
The product data is stored locally on your machine under data/google_cloud/products_xy.json.
All products are matched with the vendor's sku (stock keeping unit) when there is information to scrap from the vendor's online store.

(1) You must have installed Chromedriver from Selenium.
(2) After that clone the Repo and change following path for both files at dependencies/webscraping:
image

(3) run main.py and track its progress in the terminal.
(4) When main.py is done, run analyze.py to see how much of the product data has been enriched.

Report

For more details see docs/report.md

About

Scrapp specific websites and enrich products

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published