Using RetrieveChat Powered by Couchbase Capella for Retrieve Augmented Code Generation and Question Answering
Explore the use of AG2’s RetrieveChat for tasks like code generation from docstrings, answering complex questions with human feedback, and exploiting features like Update Context, custom prompts, and few-shot learning.
AG2 offers conversable agents powered by LLM, tool or human, which can be used to perform tasks collectively via automated chat. This framework allows tool use and human participation through multi-agent conversation. Please find documentation about this feature here.
RetrieveChat is a conversational system for retrieval-augmented code
generation and question answering. In this notebook, we demonstrate how
to utilize RetrieveChat to generate code and answer questions based on
customized documentations that are not present in the LLM’s training
dataset. RetrieveChat uses the AssistantAgent
and
RetrieveUserProxyAgent
, which is similar to the usage of
AssistantAgent
and UserProxyAgent
in other notebooks (e.g.,
Automated Task Solving with Code Generation, Execution &
Debugging).
Essentially, RetrieveUserProxyAgent
implement a different auto-reply
mechanism corresponding to the RetrieveChat prompts.
Table of Contents
We’ll demonstrate six examples of using RetrieveChat for code generation and question answering:
Some extra dependencies are needed for this notebook, which can be installed via pip:
For more information, please refer to the installation guide.
Setup Couchbase Capella Cluster
Before we proceed with the notebook. We will require a Couchbase Capella Database Cluster Running.
-
To setup a free operational cluster, Head over Couchbase Cloud and create an account. Over there create a free cluster. For more details on creating a cluster, refer here.
-
After creating the cluster, We will create our required bucket, scope, collections. Head over to Data Tools. On the left hand side panel, you will get an option to create a bucket. Give appropriate Bucket Name, Scope Name, Collection Name. For this tutorial, set
- Bucket Name : test_db
- Scope Name: test_scope
- Collection Name: demo_collection
-
Now, we will connect to the cluster. Refer this page for connection
-
Next, let’s create user to connect. Head over to settings tab. Click [Create Cluster Access]. Specify a username and password and give read/write all buckets access. You may create more users with less than admin access. For more information, refer here
-
Add IP Address to allowed IPs. in settings, click on [Networking]. Add allowed IP based on your requirements.
-
Let’s fill our environment variables. First get the connection string from the [Connect] Tab. Now we will have 3 envs for Couchbase cluster and one for open AI
- CB_CONN_STR : Couchbase Cluster Connection string
- CB_USERNAME : Username of the user, created above
- CB_PASSWORD : Password of the user, created above
- OPENAI_API_KEY : OpenAI API Key which agents will require.
Voila, your cluster is ready to be used.
Set your API Endpoint
Learn more about configuring LLMs for agents here.
Construct agents for RetrieveChat
We start by initializing the AssistantAgent
and
RetrieveUserProxyAgent
. The system message needs to be set to “You are
a helpful assistant.” for AssistantAgent. The detailed instructions are
given in the user message. Later we will use the
RetrieveUserProxyAgent.message_generator
to combine the instructions
and a retrieval augmented generation task for an initial prompt to be
sent to the LLM assistant.
Example 1
Use RetrieveChat to help generate sample code and automatically run the code and fix errors if there is any.
Problem: Which API should I use if I want to use FLAML for a classification task and I want to train the model in 30 seconds. Use spark to parallel the training. Force cancel jobs if time limit is reached.
Note: You may need to create an index on the cluster to query