DocsGPT: Migrating One of the Industry’s Most Popular Open Source AI Assistants to Atlas Vector Search

Mat Keep
February 6, 2024 | Updated: February 21, 2024
#genAI #Vector Search

Since its founding in 2019, Arc53 has focused on building predictive AI/ML solutions for its clients, with use cases ranging from recommendation engines to fraud detection. But it was with OpenAI’s launch of ChatGPT in November 2022 that the company saw AI rapidly take a new direction.

As Arc53 co-founder Alex Tushynski explains, “It was no surprise to see generative AI suddenly capture market attention. Suddenly developers and data teams were being challenged to bring their companies’ own proprietary data to gen AI models, in what we now call retrieval-augmented generation (RAG). But this involved them building new skills and disciplines. It wasn’t easy as they had to stitch together all of their different databases, data lakes, file systems, and search engines, and then figure out how to feed data from those systems into their shiny new vector stores. Then they had to orchestrate all of these components to build a complete solution. We identified an opportunity to abstract this complexity away from them. So DocsGPT was born.”

DocsGPT is an open-source documentation assistant that makes it easy for developers to build conversational user experiences with natural language processing (NLP) directly on top of their data. That can be a chatbot on a company website for customer support or as an interface into internal data repositories to help boost employee productivity.

Developers simply connect their data sources to DocsGPT to experiment with different embedding and large language models to optimize for their specific use case. LLM options currently include ChatGPT 3.5 and 4, along with DocsGPT-7B, based on Mistral.

In addition to the choice of models, developers can choose where they deploy DocsGPT. They can download the open source code to run in their own environment or consume DocsGPT as a managed service from Arc53.

The freedom developers enjoy with DocsGPT is reflected in its levels of adoption. Since its release last year, the project has accumulated close to 14,000 GitHub stars and built a vibrant community with over 100 independent contributors. Tushynski says, “DocsGPT counts the UK government’s Department of Work and Pensions, pharmaceutical industry solution provider NoDeviation, and nearly 20,000 other users.”

Tushynski and team selected MongoDB Atlas as the database for the DocsGPT managed service. “We’ve used MongoDB in many of our prior predictive AI projects. Its flexibility to store data of any structure, scale to huge data sets, and ease of use for both developers and data scientists means we can deliver richer AI-driven solutions faster. Using it to underpin DocsGPT was an obvious choice. As developers connect their documentation to DocsGPT, MongoDB stores all of the metadata, along with chat history and user account information.”

Migrating from Elasticsearch to MongoDB Atlas Vector Search

With the release of Atlas Vector Search, the DocsGPT team is now migrating its vector database from Elasticsearch into MongoDB Atlas. Tushynski says, “MongoDB is a proven OLTP database handling high read and write throughput with transactional guarantees. Bringing these capabilities to vector search and real-time gen AI apps is massively valuable. Atlas is able to handle highly dynamic workloads with rapidly changing embeddings in ways Elasticsearch cannot. The latency Elasticsearch exhibits as it merges updates into existing indexes means the app is often retrieving stale data, impacting the quality and reliability of model outputs.”

Tushynski goes on to say, “We’ve experimented with a number of standalone vector databases. There are some good technologies there, but again, they don’t meet our needs when working with highly dynamic genAI apps. We often see users wanting to change embedding models as their apps evolve — a process that means re-encoding the data and updating the vector search index. For example, we’ve migrated our own default embedding models from OpenAI to multiple open-source models hosted on Hugging Face and now to BGE. MongoDB’s OLTP foundations make this a fast, simple, and hassle-free process.”

The unification and synchronization of source data, metadata, and vector embeddings in a single platform, accessed by a single API, makes building gen AI apps faster, with lower cost and complexity.
Alex Tushynski, co-founder, Arc53

Tushynski discusses the importance of embedding models in his blog post, Amplify DocsGPT with optimal embeddings. The post includes an example of how one customer was able to improve measured user experience by 50% simply by updating their embedding model.

Figure 2: *Demonstrating the impact of vector embedding choices*

“One of the standout features of MongoDB Atlas in this context is its adeptness in handling multiple embeddings. The ability to link various embeddings directly with one or more LLMs without the necessity for separate collections or tables is a powerful feature," Tushynski says. "This approach not only streamlines the data architecture but also eliminates the need for data duplication, a common challenge in traditional database setups. By facilitating the storage and management of multiple embeddings, it allows for a more seamless and flexible interaction between different LLMs and their respective embeddings.”

Being part of AI Innovators program, the DocsGPT engineering team gets free Atlas credits as well as access to technical expertise to help support their migration. The AI Innovators program is open to any startup that is building AI with MongoDB.

Check out our AI resource page to learn more about building AI-powered apps with MongoDB.

← Previous

Spotlight on Two Aussie Start-Ups Building AI Services on MongoDB Atlas

Australian-based Eclipse AI and Pending AI are using the power of MongoDB Atlas to bring their AI ideas to life and blaze new trails in fields including pharmaceutical R&D and customer retention. With the recent advancements in the fields of AI and generative AI, innovation has been unleashed to new heights. Many organisations are taking advantage of technologies such as Natural Language Processing (NLP), Large Language Models (LLMs), and more to create AI-driven products, services, and apps. Amongst those that are blazing new trails in the AI space are two Australian start-ups: Pending AI , which is helping scientists and researchers in the pharmaceutical space improve early research & development stages, and Eclipse AI , a company that unifies and analyses omnichannel voice-of-customer data to give customers actionable intelligence to drive retention. What they have in common is their choice to use MongoDB Atlas . This multi-cloud, developer data platform unifies operational, analytical, and generative AI data services to streamline building AI-enriched applications. Here is how we are helping these two Australian start-ups create the next generation of AI products faster, with less complexity, and without breaking the bank. Pending AI improves pharmaceutical R&D by leveraging next-generation technologies Pending AI has developed a suite of artificial intelligence and quantum mechanics-based capabilities to solve critical problem statements within the earliest stages of pharmaceutical research and development. The Pending AI platform is capable of dramatically improving the efficiency and effectiveness of the compound discovery pipeline, meaning stakeholders can obtain better, commercially viable scaffolds for further clinical development in a fraction of the time and cost. Building its two artificial intelligence-based capabilities - Generative Molecule Designer and Retrosynthesis Engine - was a mammoth task. The known number of pharmacologically relevant molecules in chemical space is exceptionally large, and there are over 50 million known chemical reactions and billions of molecular building blocks - expert scientists have to undergo cost- and time-inefficient trial-and-error processes to design desired molecules and identify optimal synthesis routes to them. Pending AI needed a database that could handle a very large number of records, and be highly performant at that scale, as required by the vastness of chemical space. A few databases were considered by Pending AI, but MongoDB kept standing out as a battle-tested, reliable, and easy-to-implement solution, enabling Pending AI’s team to build a highly performant deployment on MongoDB Atlas. “As a startup, getting started with the community edition of MongoDB and being able to run a reliable cluster at scale was a huge benefit. Now that we’re starting to leverage the AWS infrastructure in our platform, MongoDB Atlas provides us with a fully managed solution at a low cost, and with a Private Endpoint between our AWS deployment and MongoDB cluster, we have kept latency to a minimum, and our data secure,” said Dr. David Almeida Cardoso , Vice President, Business Development at Pending AI. Output of Pending AI's Generative Molecule Designer Pending AI’s Generative Molecule Designer has been built as a machine learning model on MongoDB Atlas, trained to understand the language of pharmaceutical structures, which allows for automated production of novel compound scaffolds that can be focused and tailored to outputs of biological and/or structural studies. The Retrosynthesis Engine is also built using a set of machine learning models and MongoDB Atlas, trained to understand chemical reactions, which allows for the prediction of multiple, valid synthetic routes within a matter of minutes. “We’re also excited to explore the new Atlas Search index feature in MongoDB 7.0. We hope this will allow us to integrate some of the search functionality, which is currently complex to manage and maintain, directly into MongoDB, rather than relying on a separately maintained Elasticsearch cluster,” added Cardoso. Being part of the MongoDB AI Innovator program also allowed Pending AI to explore leveraging cloud infrastructure to scale its platform and test newer versions of MongoDB quickly and easily. Eclipse AI turns customer interaction insights into revenue Eclipse AI is a SaaS platform that turns siloed customer interactions from different sources - these can be customer calls, emails, surveys, reviews, support tickets, and more - into insights that drive retention and revenue. It was created to address the frustration of customer experience (CX) teams around the number of hours and man-weeks of effort needed to consolidate and analyse customer feedback data from different channels. Eclipse AI took on the challenge of solving this issue and worked hard to find a way to offer customers faster and more efficient ways to turn customer feedback into actionable insights. The first problem was consolidating the voice-of-customer data which was so fragmented; the second was analysing that data and turning it into specific improvement actions to improve the customer experience and prevent customer churn. Because MongoDB Atlas is a flexible document database that also can store and index vector embeddings for unstructured data, it was a perfect fit for Eclipse AI and enabled its small dev team to focus on building the product very efficiently and quickly, without being burdened with managing infrastructure. MongoDB Atlas also comes with key features such as MongoDB Atlas Device SDKs (formerly Realm) and MongoDB Atlas Search that were instrumental in bringing Eclipse AI’s platform to life. "For us, MongoDB is more than just a database, it is data-as-a-service. This is thanks to tools like Realm and Atlas Search that are seamlessly built into the platform. With minimum effort, we were able to add a relevance-based full-text search on top of our data. Without MongoDB Atlas we would not have been able to iterate quickly and ship new features fast,” commented Saad Irfani, co-founder of Eclipse AI. “Best of all, horizontal scaling is a breeze with single-click sharding that doesn't require setting up config servers or routers, reducing costs along the way. The unified monitoring and performance advisor recommendations are just the cherry on top.” Eclipse AI - MongoDB dashboard G2 rated Eclipse AI as the #1 proactive customer retention platform globally for SMEs, a recognition that wouldn’t have been possible without the use of MongoDB Atlas. Exploring your AI potential with MongoDB MongoDB Atlas is built for AI . Why? Because MongoDB specialises in helping companies and their developer teams manage richly structured data that doesn't neatly fit into the rigid rows and columns of traditional relational databases, and turn that into meaningful and actionable insights that help operationalise AI. More recently, we have added Vector Search - enabling developers to build intelligent applications powered by semantic search and generative AI over any type of data - and enhanced AWS CodeWhisperer coding assistant to our list of tools companies can use to further their AI exploration. These are just a handful of examples of what is possible in the realm of AI today. Many of our customers around the world, from start-ups to large enterprises like banks and telcos are investing in MongoDB Atlas and capabilities such as Atlas Search , Vector Search , and more to create what the future of AI and generative AI will look like in the next decade. If you want to learn more about how you can get started with your AI project, or take your AI capabilities to the next level, you can check out our MongoDB for Artificial Intelligence resources page for the latest best practices that get you started in turning your idea into an AI-driven reality.

February 5, 2024

Next →

The Developers' Developers: Two Australian Developers Share Their Connections to Customers

The world’s 28 million software developers are writing the foundations of our future, propelling innovation for their organizations through lines of code by creating game-changing new apps. Indeed, the US Bureau of Labor Statistics predicts that between 2022 and 2032, the number of software developers, quality assurance analysts, and testers will grow 24%, “much faster than the average for all occupations.” Fueling this innovative workforce is another group of developers, the people working behind the scenes to build the tools, technologies and platforms that other developers need to be successful: the developers’ developers. Many developers at MongoDB—which after all was built by developers for developers, and is beloved by enterprises and startups alike—fall into this camp. To learn more about what makes these developers tick, we talked with two Australia-based senior software engineers at MongoDB who love to code for their peers. For Lavender Chan and Angus Lee, there’s nothing like seeing the ripple effect of the code they have been working on and the impact it has on their customers. What’s more, the opportunity to be a “developer’s developer” has allowed Chan and Lee to find a space for deep technical work while thriving in an autonomous environment. At MongoDB we believe developers will build the future. First, can you share more about your roles and what you’re working on? Lavender Chan (LC): I work on the Relational Migrator tool, which allows developers to migrate SQL data onto MongoDB. I joined the company two and a half years ago, and have been part of the Sydney technology scene for the last 10 years. The appeal of joining MongoDB was that it’s a large global company, but in the engineering team you are able to have a big impact and a lot of autonomy. Relational Migrator was a greenfield project, and our team has been able to take the original product idea built out of the US and run with it. I’m a full stack developer and have touched on every feature of the tool. A lot of the engineers were able to contribute and work on new ideas. There’s also a strong emphasis on culture here which practically means a lot of the people I work with are excited to be here and passionate about their roles. Angus Lee (AL): I work in the MongoDB Charts team in Australia and think our team is a sweet spot for developers. I’ve interned for other tech companies and started my career at MongoDB. Since then I’ve been given responsibilities where I can create a lot of impact. My role at MongoDB in Sydney has also given me great opportunities to connect directly to the developers we are creating products for in a way that pushes my work to a higher level. Your roles are focused on products targeted to other developers. How does developing for developers affect your approach to your work? LC: In our roles we are creating directly for other developers, so the work that I am doing is deeply technical and specific. As Migrator is a newer product, we are able to interact directly with our customers—other developers—and often a lot of their questions are quite complex and specific, which means I go on a learning journey in debunking and fixing their problems. AL: We have a strong team culture in that as developers we want to be our own users. That means we want to use other MongoDB team products, and they use ours, so we can better identify pain points and issues for our customers. There’s a term that developers use called “dogfooding” that really sums up how we think on this, which basically means to use your own product. It means for me that I think about writing clean code to help any other developers extend on this, and how effective what I do will be for the user. What I’ve also learnt is how our product helps other products thrive. We should have done all the hard work to transform data and show it through data visualizsation tools so it’s easy for the customer. Can you tell us more about this connection to customers and how MongoDB empowers developers? LC: When Relational Migrator was released as a general product, I went to MongoDB World to work at the booth, and I talked to the developers and customers using the platform. As an engineer, it was an amazing experience and opportunity to see how it was being used and what else we could be doing. This connection of engineers with customers, as well as the ability to speak to them regularly in my role, is unique. In other companies I would need to go through support teams, to go through someone else just to push out a bug fix. Our team is very customer focused, so we can prioritizse features that our customers want. AL: One of the best moments for me at MongoDB was when I went to MongoDB World and I sat down with a customer to talk through a feature of Charts. It was a pivotal moment to see the improvements it makes for the businesses that use it, and the impact it generates for their customers beyond that. I could sit back and see the ripple effect of the code I’m writing. There is also a great feedback engine where our users can submit ideas and other users can vote for that feature. The product managers pick from these and push out features that are directly relevant to the developers using it. Where we really connect is in our aim to create an open forum for developers and customers to provide feedback and suggest ideas. Developers are problem solvers. As part of the MongoDB Love Your Developers campaign, we believe in championing the voice of developers and giving them the freedom to experiment and innovate. How do you see this in action? LC: In other places I was a small cog in a massive system. At MongoDB, I really have an impact, and can see directly how my work translates to our final product. In Sydney, we’re a satellite office, but it’s indicative of our company culture that there is huge trust placed in these teams. We’re given high impact projects and can run with them, which means I’ve been able to watch the Relational Migrator product go from a tiny product used by only a few customers, to one that is now generally available. Not many engineers get to work for a well-established,large company and still have the opportunity to work on and release products like a startup. There is a strong global interest in AI-driven innovations. How have your connections to customers led to innovations in this area? AL: We’ve been able to take an idea for a new AI feature, Natural Language Charts , and take it from concept to being released as a feature at MongoDB.local in London. We could see from our conversations with customers, as well as broader industry trends, that there was strong interest in new AI features so we were able to prioritizse it for Charts. We started with nothing and were given the freedom to research how this feature could work using AI, create a new proof of concept, and from there we were able to push it out into a feature which was a really proud moment. Having this agility and flexibility to prioritize something new is exactly what we want to provide to our customers. I never feel like I'm just churning out code. We are connected to the work and to our customers. MongoDB is built by developers, for developers. Become part of the team changing the way the world works with data!

May 9, 2024