Using Artificial Intelligence to pull summarized content from large document collections
Overview
The Digital Transformation Hub at Cal Poly (DxHub), in partnership with Amazon Web Services (AWS), has collaborated with the Data Lab team at the World Bank to create and launch an innovative AI-powered chatbot named ‘Pluto’. This chatbot distills over 75 years of developmental knowledge encapsulated within thousands of publicly accessible reports. Pluto enables development teams to efficiently access advice and insights derived from previous development projects, such as those focused on wastewater and water treatment. For instance, by querying Pluto with a request like “give me the top 3 lessons learned on clean water projects in Africa”, users receive synthesized, high-quality responses along with links to the source documents for in-depth exploration. This approach maximizes the strengths of AI technology by leveraging easier search mechanisms and the summarization of aggregate data while mitigating hallucinations.
Problem
Historically, governmental project teams in countries receiving World Bank loans have faced challenges in accessing comprehensive lessons from past development projects. The vast public documentation available was cumbersome to navigate, making it a daunting task to extract relevant lessons and advice applicable to new initiatives. The solution being developed aims to overcome this barrier by offering an easy-to-use, interactive interface. This interface is built atop an intelligent search engine that probes the extensive archives, records, and data of the World Bank, streamlining access to invaluable insights. For the purpose of the pilot, the team worked with a subset of data that focused on water quality projects.
Innovation in action
Solution
About the DxHub
The Cal Poly Digital Transformation Hub (DxHub) is a strategic relationship with Amazon Web Services (AWS) and is the world’s first cloud innovation center supported by AWS on a University campus. The primary goal of the DxHub is to provide real-world problem-solving experiences to students by immersing them in the application of proven innovation methods in combination with the latest technologies to solve important challenges in the public sector. The challenges being addressed cover a wide variety of topics including homelessness, evidence-based policing, digital literacy, virtual cybersecurity laboratories and many others. The DxHub leverages the deep subject matter expertise of government, education and non-profit organizations to clearly understand the customers affected by public sector challenges and develops solutions that meet the customer needs.