Create main.py by Rabkhat · Pull Request #10 · solutionaryme/barbase-tools

Rabkhat · 2025-08-21T15:58:07Z

import asyncio
import aiohttp
from bs4 import BeautifulSoup as BS
from fake_useragent import UserAgent
import json
import re

BASE_URL = "https://ovdi.ru/shop/malysham/"
HEADERS = {"User-Agent": UserAgent().random}

async def main():
products = []

async with aiohttp.ClientSession() as session:
    async with session.get(BASE_URL, headers=HEADERS) as response:
        html = await response.text()
        soup = BS(html, "html.parser")

        items = soup.find_all("div", {"class": "bx_catalog_item_container"})
        for item in items:
            title_tag = item.find("div", {"class": "bx_catalog_item_title"})
            title = title_tag.text.strip() if title_tag else "No title"

            link_tag = title_tag.find("a") if title_tag else None
            link = "https://ovdi.ru" + link_tag.get("href") if link_tag else None

            price_tag = item.find("div", {"class": "bx_catalog_item_price"})
            price = price_tag.text.strip() if price_tag else "No price"

            barcode = "Штрихкод табылмады"
            if link:
                async with session.get(link, headers=HEADERS) as product_response:
                    product_html = await product_response.text()
                    product_soup = BS(product_html, "html.parser")

                    text = product_soup.get_text(" ", strip=True)
                    match = re.search(r"Штрихкод:\s*(\d+)", text)
                    if match:
                        barcode = match.group(1)

            products.append({
                "title": title,
                "price": price,
                "link": link,
                "barcode": barcode
            })

with open("products.json", "w", encoding="utf-8") as f:
    json.dump(products, f, ensure_ascii=False, indent=4)

print("✅ products.json дайын!")

if name == 'main':
asyncio.run(main())

Copilot

Pull Request Overview

This PR introduces a web scraping script that extracts product information from an e-commerce website. The script asynchronously scrapes product listings and their detailed pages to collect titles, prices, links, and barcodes.

Key Changes

Creates an async web scraper using aiohttp and BeautifulSoup
Implements product data extraction with barcode lookup from individual product pages
Outputs scraped data to a JSON file

_{Tip: Customize your code reviews with copilot-instructions.md. Create the file or learn how to get started.}

domains/main.py

Copilot · 2025-08-22T04:50:55Z

domains/main.py

+    with open("products.json", "w", encoding="utf-8") as f:
+        json.dump(products, f, ensure_ascii=False, indent=4)
+
+    print("✅ products.json дайын!")


[nitpick] The success message is in a non-English language while the rest of the code uses English. Consider using consistent language throughout the codebase.

Suggested change

print("✅ products.json дайын!")

print("✅ products.json is ready!")

Copilot · 2025-08-22T04:50:55Z

domains/main.py

+                    "price": price,
+                    "link": link,
+                    "barcode": barcode
+                })


No error handling for HTTP requests. Consider adding try-except blocks to handle potential network errors, timeouts, or HTTP error status codes.

Suggested change

})

try:

async with session.get(BASE_URL, headers=HEADERS) as response:

if response.status != 200:

print(f"Error: Failed to fetch main page. Status code: {response.status}")

return

html = await response.text()

soup = BS(html, "html.parser")

items = soup.find_all("div", {"class": "bx_catalog_item_container"})

for item in items:

title_tag = item.find("div", {"class": "bx_catalog_item_title"})

title = title_tag.text.strip() if title_tag else "No title"

link_tag = title_tag.find("a") if title_tag else None

link = "https://ovdi.ru" + link_tag.get("href") if link_tag else None

price_tag = item.find("div", {"class": "bx_catalog_item_price"})

price = price_tag.text.strip() if price_tag else "No price"

barcode = "Штрихкод табылмады"

if link:

try:

async with session.get(link, headers=HEADERS) as product_response:

if product_response.status != 200:

print(f"Warning: Failed to fetch product page {link}. Status code: {product_response.status}")

else:

product_html = await product_response.text()

product_soup = BS(product_html, "html.parser")

text = product_soup.get_text(" ", strip=True)

match = re.search(r"Штрихкод:\s*(\d+)", text)

if match:

barcode = match.group(1)

except (ClientError, TimeoutError) as e:

print(f"Warning: Exception occurred while fetching product page {link}: {e}")

products.append({

"title": title,

"price": price,

"link": link,

"barcode": barcode

})

except (ClientError, TimeoutError) as e:

print(f"Error: Exception occurred while fetching main page: {e}")

return

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Create main.py

3441591

smvrnn requested a review from Copilot August 22, 2025 04:49

Copilot AI reviewed Aug 22, 2025

View reviewed changes

Rabkhat and others added 2 commits August 22, 2025 10:20

Update main.py

ced5bb9

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Update main.py

4201bd1

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Create main.py#10

Create main.py#10
Rabkhat wants to merge 3 commits intosolutionaryme:mainfrom
Rabkhat:patch-3

Rabkhat commented Aug 21, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI Aug 22, 2025

Uh oh!

Copilot AI Aug 22, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

	print("✅ products.json дайын!")
	print("✅ products.json is ready!")

Conversation

Rabkhat commented Aug 21, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Key Changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI Aug 22, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Aug 22, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants