Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Enhance document content intelligently #20

Open
DaveParr opened this issue Jun 4, 2024 · 0 comments
Open

Enhance document content intelligently #20

DaveParr opened this issue Jun 4, 2024 · 0 comments
Labels
enhancement New feature or request

Comments

@DaveParr
Copy link
Owner

DaveParr commented Jun 4, 2024

It might be valuable to call an llm during read time to enhance document data. 2 operations might be useful

  • At document creation, before vectorstor embedding, an llm could be tasked to catagorise the content based on user defined tags. e.g. if the repo is to do with data science, or if it is likely a repo from a hackathon and not a module to be used in other projects.
  • To use a form of Hypothetical document embedding at ETL time, e.g. supply the repo readme.md content to a llm with the prompt "based on this readme.md what is the intentended use case for this repository, and what problems does it solve/ tasks does it help with?"
@DaveParr DaveParr added the enhancement New feature or request label Jun 4, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant