GenAI Part 2


To understand any Github RepoLearn any GitHub repo in 59 seconds. Onboard AI learns any GitHub repo in minutes and lets you chat with it to locate functionality, understand different parts, and generate new code

https://app.getonboardai.com/

=========================================================

Langchain

https://github.com/kyrolabs/awesome-langchain

=========================================================

Model file 

extension: HF, GPTQ, GGML, BIN and GGUF.

Each model needs 4 files (with example content)

File 1 - The model's GGUF file

File 2 - The model's .yaml file

backend: llama

context_size: 2000

name: lunademo

parameters:

  model: luna-ai-llama2-uncensored.Q4_K_M.gguf

  temperature: 0.2

  top_k: 40

  top_p: 0.65

roles:

  assistant: 'ASSISTANT:'

  system: 'SYSTEM:'

  user: 'USER:'

roles:

  assistant: 'ASSISTANT:'

  system: 'SYSTEM:'

  user: 'USER:'

File 3 - The Chat API .tmpl file

{{.Input}}

ASSISTANT:

Complete the following sentence: {{.Input}}

File 4 - The Completion API .tmpl file

=========================================================

Local Deployment of GenAI

1. https://github.com/cocktailpeanut/dalai

2. GPT 4 all https://gpt4all.io/index.html https://github.com/nomic-ai/gpt4all

3. vLLM library https://vllm.readthedocs.io/en/latest/#

4. API store https://gorilla.cs.berkeley.edu/ and https://github.com/ShishirPatil/gorilla

5. https://github.com/imartinez/privateGPT and https://docs.privategpt.dev/ Based on PrivateGPT https://github.com/marella/chatdocs

6. https://github.com/Lightning-AI/lit-gpt based on nanoGPT

7. https://github.com/Vision-CAIR/MiniGPT-4/tree/main and https://minigpt-4.github.io/ https://huggingface.co/spaces/Vision-CAIR/minigpt4  https://www.youtube.com/watch?v=__tftoxpBAw

8. https://collabnix.com/running-ollama-2-on-nvidia-jetson-nano-with-gpu-using-docker/

9. Langchain

https://python.langchain.com/docs/guides/local_llms

10. Casandra DB

https://cassio.org/frameworks/langchain/qa-basic/

https://colab.research.google.com/github/CassioML/cassio-website/blob/main/docs/frameworks/langchain/.colab/colab_qa-basic.ipynb

11. gtr-t5-large model is around 640 MB

https://til.simonwillison.net/python/gtr-t5-large

=========================================================

Local Document search

1. https://github.com/h2oai/h2ogpt

2. https://github.com/imartinez/privateGPT and https://docs.privategpt.dev/  OR https://github.com/SamurAIGPT/EmbedAI

3. https://github.com/PromtEngineer/localGPT

4. LocalAI

https://localai.io/

https://github.com/mudler/LocalAI

https://github.com/go-skynet/helm-charts/tree/main/charts/local-ai and https://localai.io/basics/getting_started/index.html#run-localai-in-kubernetes

https://localai.io/basics/build/index.html

integration with Logseq https://github.com/briansunter/logseq-plugin-gpt3-openai

How Tos https://localai.io/howtos/

https://localai.io/howtos/easy-request-openai/

Access

https://localai.io/howtos/easy-model-import-downloaded/

https://localai.io/howtos/easy-request-curl/

All Git Repo https://github.com/lunamidori5

5. https://mudler.pm/posts/localai-question-answering/

6. GPT3 and datasette

https://simonwillison.net/2023/Jan/13/semantic-search-answers/

0 comments:

Post a Comment