By: Esteve Graells

Esteve Graells — Fri, 15 Dec 2023 13:14:53 +0000

In reply to Jitendra. Hi Jitendra, I watched the 3 posts, but I thought posting in the first one just to help on the following 2, if you prefer I can post my comment on the third one.

By: Jitendra

Jitendra — Fri, 15 Dec 2023 12:06:05 +0000

In reply to Esteve Graells. Thats great explanation Esteve, Thank You. In this blog post , there is no RAG. I am just show casing how to do embedding. RAG is happening in part 2 blog post where I have used Langchain to enrich prompt.

By: Esteve Graells

Esteve Graells — Fri, 15 Dec 2023 09:50:52 +0000

Hello Jitendra,

I would like to begin by commending your excellent example of a RAG implementation in the context of Salesforce. Thanks for sharing it with the community, very appreciate it.

However, I’d like to offer some feedback regarding certain aspects of your explanation:

Touching base concepts: the concepts that you mention about training the model with RAG or emproving the model do not apply here. The MML keeps immutable and you can only enrich via fine-tunning, what is hugely expensive and need very dedicated time and effort to create datasets that are good but not contaminated. RAG is a technique to enrich a prompt not improving/incrementing a model via some pre-work before sending the request to the completion API to the model.

Input to LLM: It’s important to clarify that any Large Language Model (LLM), whether it’s an openAI-compatible API or not, doesn’t receive a pre-existing vector or embedding as input. Instead, it processes a combination of elements, such as a custom context, system prompt, user query and optionally, an assistant prompt in the form of text, never a vector or any other form.

Purpose of Embeddings and Vector Database: The creation of embeddings and the establishment of a vector database serve the purpose of enriching the knowledge base, never to train the LLM as it can’t be trained but only amplified with a huge cost (not applicable here). This knowledge base is constructed by capturing information before sending the context to the LLM by using what is called a RAG pipeline: a vector database that receives a loader (splitting and vectorizing splits) and retrievers which try to find the best semantic matches with the user query.

Semantic Search: The semantic search operation is executed on the user’s query against the vector database. The results are generated based on the applied search algorithm. These results consist of the context that is subsequently sent to the LLM along with the user’s request and other parts of the prompts. As you sometimes experienced the retriever can find the right semantic asnwers, because your retriever algorithm is not working well, I think you could try Contextual Compression or my favourite the Multi-query retriever.

Role of Embeddings: While creating embeddings does involve populating the vector database, it is crucial to create the right embeddings, but you didn’t mention how to splitt the knowledge that you want to augment. Splitting in the wrong way without overlapping techniques and not using MMR like prompting agents will probably provide poor results.

Deployment Considerations: You mentioned encountering challenges with the use of Lambdas for deployment. It’s worth noting that there are alternative deployment options much better than Lambdas. Solutions such as Bedrock, which can facilitate the use of other LLMs that may be more tailored to your specific problem is one of my favorites. Additionally, you may consider employing larger models like Falcon to address your requirements if desired or constraints or chat ones reducing the sizing requirements a lot.

I trust that these observations will be beneficial not only to you but also to your readers, including individuals like myself who deeply appreciate the work you have undertaken. Your dedication to the Salesforce arena, and I look forward to seeing more of your contributions in the future.
Please don’t hesitate to reach me if you want to clarify some of these points as I would be honored.

By: Jitendra

Jitendra — Fri, 15 Dec 2023 00:10:29 +0000

Please make sure these libraries are installed using PIP command – openai , numpy , pandas , requests

By: Yamini Machha

Yamini Machha — Thu, 14 Dec 2023 23:55:04 +0000

Hi Sir, Thanks for the blogpost,I tried but getting this error —
cosine_similarity
return np.dot(A, B) / (norm(A) * norm(B))
~~~~~~~~^~~~~~~~~
TypeError: unsupported operand type(s) for *: ‘rv_continuous_frozen’ and ‘rv_continuous_frozen’

Comments on: Converting Salesforce Data into Embeddings with OpenAI and AWS Lambda

By: Esteve Graells

By: Jitendra

By: Esteve Graells

By: Jitendra

By: Yamini Machha