Skip to content

Training :This project is dedicate to get some information on website

Notifications You must be signed in to change notification settings

YonathanGuez/crawler_python3

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

15 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Beginning Crawler / Scraping with Python3 :

This project is dedicate to get some information on website and test some library like BeautifulSoup / Chromedrive / requests

Install

$ pip install -r requirements.txt

Check the Hierarchy of all Tags Heading

This is a Project check all Tags Hn and the number of character for each tags Hn

This project call some SEO technical like how to check the hierarchy of hn.
It is important for the rang of the site if you not use well you can have a bad rank.

In this script we will use only the library : requests and BeautifulSoup

Configuration:

Python 3.5 pip 20

Run the code :

python check_heading_hierarchy.py --url <https://URL>

Example how to use it :

Click here :

Help_archive:

Chrome

version : 80.0.3XXXX

Configuration

Python 3.5 pip 20 chromedrive

Example and test for help me to build the project:

	 python get_all_anchors.py
	 python check_heading_hierarchy.py
	 python get_load_time_chromedrive.py
	 python get_meta_title.py
	 python get_requests_status_headers.py
	 python scrapjs_chrome_headless.py

About

Training :This project is dedicate to get some information on website

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages