Skip to content

Releases: fcavallarin/htcrawl

v1.3.0: Release version 1.3.0 (#20)

21 Oct 16:51
471c952

Choose a tag to compare

  • Upgraded libraries to the latest version
  • Changed Xpath selector for the new Puppetter
  • Improved XHR/Fetch request handling

v1.2.1

11 Dec 08:01
f620f25

Choose a tag to compare

The body of XHR/Fetch requests is fetched only if <= 5kb

v1.2.0: Added iframe and postMessage support

10 Dec 10:02
f41a6f2

Choose a tag to compare

Iframe Integration

Iframes are now fully integrated and treated as part of the main page, facilitating easier crawling.
Key improvements include:

  • New CSS Selector: A new CSS selector has been introduced for selecting elements within iframes.
  • Cross-Origin Iframe Crawling: It's now feasible to crawl iframes that are not of the same origin.

postMessage Handling

A new feature to intercept and manipulate 'postMessage' calls has been implemented.

v1.1.1

16 Nov 11:57
49e8af0

Choose a tag to compare

  • Messages sent to the chrome-extension now use a message-queue
  • getElementSelector handles pages with non-unique IDs

v1.1.0

12 Nov 13:16
f28b9dc

Choose a tag to compare

  • Added UI as Chrome extension
  • Added unit tests
  • Added cli tools
  • Added experimental heuristic content dedpulicator
  • Bug fixes

1.0.7: Merge pull request #6 from fcavallarin/developer

13 Feb 10:30
1d65b18

Choose a tag to compare

1.0.6

29 Jan 15:01
21bc5e0

Choose a tag to compare

Merge pull request #5 from fcavallarin/developer

Updated puppeteer and minor bugfix

1.0.4

06 Dec 12:06
0551372

Choose a tag to compare

Merge pull request #3 from fcavallarin/developer

Version 1.0.4

1.0.3

06 Dec 10:37
a5a7e04

Choose a tag to compare

Merge pull request #2 from fcavallarin/developer

Improved angular coverage

1.0.2

04 Dec 21:39
9ea03f2

Choose a tag to compare

Merge pull request #1 from fcavallarin/developer

Moving some logic from browser context to node