05 Complexquery AdvancedRAG
gpu-accelerationretrieval-augmented-generationllm-inferencetensorrtnvidia-generative-ai-examplesoran-chatbot-multimodallarge-language-modelsevalsmicroservicetriton-inference-servercommunityLLMragnemo
Export
Advanced RAG Testbed for ORAN chatbot
[ ]
[ ]
[ ]
Load Documents
Follow the step number 1 defined here to upload the pdf's to Milvus server.
[ ]
[ ]
[ ]
[ ]
Generate and Store Embeddings
a) Generate Embeddings
Embeddings for documents are created by vectorizing the document text; this vectorization captures the semantic meaning of the text.
[ ]
[ ]
[ ]
[ ]
[ ]
[ ]
[ ]
[ ]
[ ]
[ ]
[ ]
lets try base RAG
[ ]
[ ]
lets try multiple query based RAG
[ ]
[ ]
lets try HYDE (hypothetical llm answer) based RAG
[ ]
[ ]
[ ]