Skip to content

Conversation

@musically-ut
Copy link

Though the README.md hinted that lxml will be used if available, the choice of parsers was forced to be only html5lib in the code.

Also, have added checks to parse only the <head> tag to improve performance on particularly large HTML files.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant