
Ultimate access to all questions.
A Generative AI Engineer is developing a RAG application that retrieves context from source documents in HTML format. They want to implement a solution with the fewest lines of code.
Which Python package should be used to extract the text from the HTML source documents?
A
pytesseract
B
numpy
C
pypdf2
D
beautifulsoup