Virtual Projects

Evaluating automated or artificial intelligence search tools for evidence synthesis


Robin Featherstone1


doi: http://dx.doi.org/10.5195/jmla.2026.2341

Volume 114, Number 2: 171-172
Received 10 2025; Accepted 12 2025

ABSTRACT

To advance information retrieval science for producing evidence syntheses at Canada's Drug Agency, the Research Information Services team developed a replicable process to evaluate automated or artificial intelligence (AI) search tools. The team inventoried 51 tools in the fall of 2023 and built a flexible evaluation instrument to inform adoption decisions and enable comparison between tools. Building on this foundational evaluation work, the team further conducted a comparative analysis on three top-ranked tools in the fall of 2024. The investigation confirmed that these automated or AI tools have inconsistent and variable performance for the range of information retrieval tasks performed by Information Specialists at Canada's Drug Agency. Implementation recommendations from this study informed a “fit for purpose” approach where Information Specialists leverage automated or AI search tools for specific tasks or project types.

Keywords: Information Sciences; Information Storage and Retrieval; Review Literature as Topic; Artificial Intelligence; Generative Artificial Intelligence; Large Language Models; Automation.

Context, aims, and significance of the virtual project: Canada’s Drug Agency is a pan-Canadian health organization that provides independent evidence and advice so health system leaders can make informed drug, health technology, and health system decisions. The Research Information Services (RIS) team at Canada’s Drug Agency consists of masters in library and information science (MLIS)-credentialled Information Specialists (ISs) who conduct systematic and abbreviated literature searches for evidence reports produced by the agency.

Given the volume of untested and rapidly evolving automated or Artificial Intelligence (AI) tools for information retrieval, RIS initiated a project to identify, evaluate, and recommend tools to incorporate into IS workflows. Canada’s Drug Agency funded this work, and the RIS project team consisted of one manager, six ISs, a statistician, and an IS methodologist.

Through this work, the RIS team developed a sustainable evaluation process to stay abreast of emerging search technologies and a customizable instrument to be used by other evidence synthesis producers. This evaluation work was foundational to the subsequent conduct of a comparative analysis of automated or AI search tools and the development of implementation recommendations for Canada’s Drug Agency and all evidence synthesis producers.

Brief description of the virtual project: For the primary project phase, RIS compiled an inventory of 51 search tools, assessed and ranked these tools, and built a flexible evaluation instrument to prioritize tools for further testing. RIS also conducted a comparative analysis of three top-rated tools from the team’s evaluations and published recommendations that showed the variable performance of AI search tools and identified cases where their use was warranted for information retrieval tasks.

Technology used: RIS’ inventory relied on curated collections of search tools: the Systematic Review Toolbox, and York Health Economics Consortium’s list of citation analysis tools. RIS supplemented the inventory with tools identified through a weekly research feed generated from a Semantic Scholar research library.

RIS built our inventory and our evaluation instrument using Microsoft Excel. The comparative analysis for this project tested Lens.org, a metadata aggregator of scholarly and patent literature; SpiderCite, a citation searching tool now available through the TERA suite of automation tools for systematic reviews; and Microsoft Copilot, a generative AI chatbot built upon OpenAI’s GPT-4 LLM5.

Advantages, limitations, and impact: An advantage to the evaluation instrument is that it allowed RIS to compare tools as they become available and to prioritize tools that the team determined to be the most likely to benefit their work. A limitation of this work is that the evaluation criteria did not account for potential sources of bias in the tools (e.g., language bias). The project team acknowledges the need for ISs to interrogate sources of biases and for tool developers to report these transparently. Despite this limitation, the flexible evaluation instrument may be modified to account for additional criteria by users – including potential sources of bias. With the development of evaluation approaches and implementation recommendations from testing, RIS’ work enables judicious use of innovative AI and automated technologies to enhance information retrieval for Canada’s Drug Agency and other evidence synthesis producers.

Contact information: requests@cda-amc.ca.

LINKS

Virtual Projects are published on an annual basis in the Journal of the Medical Library Association (JMLA) following an annual call for virtual projects in MLAConnect and announcements to encourage submissions from all types of libraries. An advisory committee of recognized technology experts selects project entries based on their currency, innovation, and contribution to health sciences librarianship.

REFERENCES

1. Canada’s Drug Agency. Development of an evaluation instrument on artificial intelligence search tools for evidence synthesis. Can J Health Technol. 2024:4(10). DOI: https://doi.org/10.51731/cjht.2024.1004.

2. Featherstone RM, Walter M, MacDougall D, Morenz E, Bailey S, Butcher R, Ford C, Loshak H, Kaunelis D. Artificial intelligence search tools for evidence synthesis: comparative analysis and implementation recommendations. Cochrane Evid Synth Methods. 2025:3:e70045. DOI: https://doi.org/10.1002/cesm.70045.


Robin Featherstone, 1 robin.featherstone@cda-amc.ca, Acting Director, Research Information Services, Canada's Drug Agency, Ottawa, Ontario


© 2026 Robin Featherstone

This work is licensed under a Creative Commons Attribution 4.0 International License.



Journal of the Medical Library Association, VOLUME 114, NUMBER 2, April 2026