Skip to content

Latest commit

 

History

History
15 lines (12 loc) · 936 Bytes

README.md

File metadata and controls

15 lines (12 loc) · 936 Bytes

Introduction and Instructions

The idea behind this repository is to enrich existing products (retail) with a specific data structure,
from the description given at the vendors online store.
The product data is stored locally on your machine under data/google_cloud/products_xy.json.
All products are matched with the vendor's sku (stock keeping unit) when there is information to scrap from the vendor's online store.

(1) You must have installed Chromedriver from Selenium.
(2) After that clone the Repo and change following path for both files at dependencies/webscraping:
image

(3) run main.py and track its progress in the terminal.
(4) When main.py is done, run analyze.py to see how much of the product data has been enriched.

Report

For more details see docs/report.md