HtmlLinkExtractor#

Beta

This feature is in beta. It is actively being worked on, so the API may change.

Extract hyperlinks from HTML content.

Expects the input to be an HTML string or a BeautifulSoup object.

Example:

extractor = HtmlLinkExtractor()
results = extractor.extract_one(HtmlInput(html, url))
Parameters:
  • kind (str) –

  • drop_fragments (bool) –