ποΈ html-to-text
When ingesting HTML documents for later retrieval, we are often interested only in the actual content of the webpage rather than semantics.
ποΈ @mozilla/readability
When ingesting HTML documents for later retrieval, we are often interested only in the actual content of the webpage rather than semantics.
ποΈ OpenAI functions metadata tagger
It can often be useful to tag ingested documents with structured metadata, such as the title, tone, or length of a document, to allow for more targeted similarity search later. However, for large numbers of documents, performing this labelling process manually can be tedious.