Our approach
Our process began with research to identify the best technical architecture, quickly landing on the now commonly used RAG system as the ideal solution. We prioritized using AWS Bedrock to explore its capabilities for building enterprise-grade AI agents.
Initial Proof of Concept: We first built a version using AWS OpenSearch for the vector store. However, we discovered this option was prohibitively expensive for our use case.
Iteration & Optimization: We pivoted to a more cost-effective architecture using Pinecone as the vector database. This required creating a custom data pipeline: a script scrapes Confluence and pushes JSON files to an S3 bucket. The content was then converted into vector embeddings using Amazon's Titan Text Embeddings V2 model before being synced with Pinecone.
Versioning & Testing: The agent is managed with alias IDs in AWS Bedrock, allowing us to develop and test new versions of the agent internally without affecting the live chatbot.
The outcome
The project successfully demonstrated our ability to create secure, knowledgeable AI chatbots for internal use. The proof-of-concept evolved into a fully functional internal tool that has been rolled out for our team, validating the effectiveness of the RAG approach and the AWS Bedrock platform. As a result, the chatbot now fields common employee questions, significantly reducing the HR workload and freeing up the team to focus on more complex tasks.
Furthermore, we have established a reusable and cost-effective architectural pattern for connecting AI agents to any internal data source, such as PDFs, text or HTML files, by leveraging an S3 bucket as the central data source.
Key features
Natural Language Interface: Employees can ask questions in plain English and receive conversational, accurate answers.
Automated Knowledge Sync: A daily script ensures the chatbot's knowledge base is always up-to-date with the latest information from Confluence.
Source-Cited Answers: Every answer includes a direct link to the detailed Confluence page, ensuring transparency and providing a path to more comprehensive information.