This tool digs through Cdiscount product pages and pulls back clean, structured data you can actually use. It solves the messy problem of collecting specs, pricing, ratings, and media from one of France’s largest marketplaces. If you're trying to understand products at scale, this scraper makes the process simple.
Created by Bitbash, built to showcase our approach to Scraping and Automation!
If you are looking for Cdiscount Product Details Scraper you've just found your team — Let’s Chat. 👆👆
The scraper collects detailed product information from Cdiscount pages and turns it into structured JSON. It helps anyone who needs reliable product intelligence without the hassle of navigating countless pages manually. It’s especially handy for analysts, ecommerce teams, and anyone keeping an eye on competitive trends.
- Handles the complex layout of Cdiscount product pages reliably.
- Captures deep technical specs, pricing, availability, and customer rating data.
- Supports bulk URL scraping with retry logic for unstable pages.
- Works smoothly with proxy configurations to avoid interruptions.
- Outputs structured fields ready for databases, dashboards, or pricing tools.
| Feature | Description |
|---|---|
| Product detail extraction | Collects product names, descriptions, pricing, brand, and availability. |
| Technical specification parsing | Extracts structured technical fields grouped by category. |
| Media collection | Retrieves all product image URLs, including main and secondary images. |
| Rating insights | Captures rating count and overall score for quality assessment. |
| Flexible input | Accepts multiple URLs with custom retry rules and proxy setup. |
| Export-ready output | Returns clean JSON usable in BI tools, apps, and pipelines. |
| Field Name | Field Description |
|---|---|
| id | Internal product identifier extracted from page structure. |
| url | Product page address used during scraping. |
| name | Full product title shown on the product page. |
| brand | Brand or manufacturer associated with the product. |
| sku | Stock-keeping identifier used to track variations. |
| gtin | Global barcode or trade item number when available. |
| product_overview | Short summarization of the product. |
| description | Full extended product description. |
| image_urls | List of image objects including URLs and main image flag. |
| availability | Stock status such as InStock or OutOfStock. |
| category | The product category as defined on-site. |
| price | Extracted price value in numeric format. |
| currency | Currency code, typically EUR. |
| total_ratings | Number of submitted customer reviews. |
| rating_score | Average customer score. |
| technique_detail | Structured specification blocks with categories and key–value details. |
[
{
"id": "bunsam55au6905",
"url": "https://www.cdiscount.com/high-tech/televiseurs/samsung-55au6905-tv-led-55-138-cm-4k-uhd-38/f-1062613-bunsam55au6905.html",
"name": "Samsung 55AU6905 - TV LED 55\" (138 cm) - 4K UHD 3840x2160 - HDR - Smart TV - Gaming HUB - 3xHDMI - WiFi",
"brand": "SAMSUNG",
"sku": "bunsam55au6905",
"gtin": "",
"product_overview": "TV LED - SAMSUNG - 55AU6905 - 55\" (138 cm) - 4K UHD 3840x2160 - HDR - Smart TV - 3xHDMI - WiFi",
"description": "TV LED - SAMSUNG - 55AU6905 - 55\" (138 cm) - 4K UHD 3840x2160 - HDR - Smart TV - 3xHDMI - WiFi",
"image_urls": [
{ "url": "https://www.cdiscount.com/pdt2/9/0/5/5/550x550/bunsam55au6905/rw/samsung.jpg", "is_main": false },
{ "url": "https://www.cdiscount.com/pdt2/9/0/5/1/550x550/bunsam55au6905/rw/samsung-main.jpg", "is_main": true }
],
"availability": "InStock",
"category": "Téléviseur LED",
"price": 399.99,
"currency": "EUR",
"total_ratings": 314,
"rating_score": 4.3
}
]
Cdiscount Product Details Scraper/
├── src/
│ ├── main.py
│ ├── scraper/
│ │ ├── parser.py
│ │ ├── fetcher.py
│ │ └── utils.py
│ ├── config/
│ │ └── settings.example.json
│ └── outputs/
│ └── writer.py
├── data/
│ ├── sample-input.json
│ └── sample-output.json
├── requirements.txt
└── README.md
- Retailers track competitor pricing so they can adjust catalogs faster and stay competitive.
- Market researchers study product trends and specs across categories to sharpen insights.
- Ecommerce teams monitor customer ratings to refine product quality and marketing decisions.
- Data analysts build dashboards using structured outputs for pricing, availability, and ratings.
- Product managers explore market gaps by comparing technical attributes across models.
Does this scraper work with any Cdiscount product link? Yes, as long as the link is a full product page URL and follows the standard structure. If the page layout matches Cdiscount’s product template, extraction works smoothly.
Can it handle large batches of URLs? It can. Grouping URLs into batches of a few hundred usually delivers the best throughput while keeping retry counts low.
What if a product has missing fields? The scraper returns whatever data is available. Some fields such as GTIN or extended specs may be empty depending on the listing.
Can I use proxies with this scraper? Yes. Proxies help maintain uninterrupted access, especially for high-volume scraping.
Primary Metric: On average, the scraper processes individual product pages in under two seconds, even when pages contain heavy images or long technical sections.
Reliability Metric: Typical success rates hover above 97 percent with retries enabled, even across large batch operations.
Efficiency Metric: Memory usage remains low thanks to streaming extraction logic, allowing hundreds of pages per run on modest hardware.
Quality Metric: Structured field completeness routinely reaches above 90 percent for listings containing detailed technical sections, ensuring dependable analytical output.
