You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Educational Python tool to crawl a website (BFS, depth-controlled) and reconstruct everything a browser can retrieve — HTML, CSS, JS, images, fonts, PDFs — with offline-safe path rewriting and same-origin enforcement. Includes srcset handling, query-hash handling and content-type-aware saving. Ideal for learning and client-side security research.
Este projeto tem como objetivo avaliar o desempenho de técnicas do estado da arte para Web Crawler e extração de conteúdos em páginas Web no contexto da automatização da avaliação de portais de transparência no estado da Paraíba
A Web Crawler which crawls the webpage in BFS form and returns the depth from origin ,most frequent word and number of valid external links on the page