klotz: python* + document*

0 bookmark(s) - Sort by: Date ↓ / Title / - Bookmarks from other users for this tag

  1. MarkItDown is a utility for converting various files to Markdown, including PDF, PowerPoint, Word, Excel, Images, Audio, HTML, text-based formats, and ZIP files.

  2. An open-source project offering a functional RAG UI for document QA, suitable for both end-users and developers. It supports various LLM providers, is customizable, and offers multi-modal QA, citations, and complex reasoning methods.

    2024-10-13 Tags: , , , , , , , by klotz
  3. pip install 'ragna builtin » ' # Install ragna with all extensions ragna config # Initialize configuration ragna ui # Launch the web app

    2023-11-02 Tags: , , , , , , , by klotz
  4. 2023-06-03 Tags: , , , , , by klotz
  5. A simple Python3 tool to detect similarities between files within a repository. Document similarity code adapted from Jonathan Mugan's tutorial: https://www.oreilly.com/learning/how-do-i-compare-document-similarity-using-python '''

    2020-03-11 Tags: , , , , by klotz
  6. tokenizing and stemming each synopsis transforming the corpus into vector space using tf-idf calculating cosine distance between each document as a measure of similarity clustering the documents using the k-means algorithm using multidimensional scaling to reduce dimensionality within the corpus plotting the clustering output using matplotlib and mpld3 conducting a hierarchical clustering on the corpus using Ward clustering plotting a Ward dendrogram topic modeling using Latent Dirichlet Allocation (LDA)

    2018-08-16 Tags: , , , , , , , by klotz

Top of the page

First / Previous / Next / Last / Page 1 of 0 SemanticScuttle - klotz.me: Tags: python + document

About - Propulsed by SemanticScuttle