Systematic Evaluation of Similarity Metrics for Retrieval, Reranking, and Completion in Retrieval Augmented Generation Systems

Elkıran, Harun; Rasheed, Jawad

doi:10.1109/ETECOM66111.2025.11319066

Systematic Evaluation of Similarity Metrics for Retrieval, Reranking, and Completion in Retrieval Augmented Generation Systems

dc.authorscopusid	59149323900
dc.authorscopusid	57791962400
dc.contributor.author	Elkıran, Harun
dc.contributor.author	Rasheed, Jawad
dc.contributor.author	Rasheed, Jawad
dc.contributor.department-temp
dc.date.accessioned	2026-04-14T20:49:17Z
dc.date.issued	2025
dc.department	Mühendislik ve Doğa Bilimleri Fakültesi
dc.description	2025 IEEE International Conference on Emerging Trends in Engineering and Computing (ETECOM) / IEEE -- ISBN:979-833156616-6 -- 2025.
dc.description.abstract	Two of the major problems with large language models (LLMs) are hallucinations and out-of-context responses. To deal with these problems, Retrieval Augmented Generation (RAG) has emerged as a promising approach. It grounds the output of LLMs in external knowledge. The effectiveness of RAG pipelines depends on several factors, including the choice of similarity metric. This paper presents a systematic evaluation of a comprehensive RAG pipeline that utilizes the Milvus vector database with HNSW indexing techniques in conjunction with OpenAI's embedding models and GPT-based completion. We conducted a comparative analysis of three widely used similarity metrics - Cosine, Inner Product, and L2 - under identical conditions. Based on the results, it was observed that retrieval and reranking performance are highly sensitive to the similarity metrics. Cosine and Inner Product consistently achieve substantially higher recall (R@10 = 0.9092-0.925), Mean Reciprocal Rank (MRR = 0.7806-0.7930), and nDCG (nDCG@10 = 0.8121-0.8252) than L2. In contrast, completion stage metrics such as token usage, cost, and latency remain largely unaffected by the choice of metric. These results underscore the crucial role of retrieval similarity functions in determining RAG effectiveness.
dc.identifier.citation	Elkiran, H., & Rasheed, J. (2025). Systematic evaluation of similarity metrics for retrieval, reranking, and completion in retrieval augmented generation systems. In 2025 IEEE International Conference on Emerging Trends in Engineering and Computing (ETECOM) (pp. 1–5). IEEE. https://doi.org/10.1109/ETECOM66111.2025.11319066
dc.identifier.doi	10.1109/ETECOM66111.2025.11319066
dc.identifier.endpage	5
dc.identifier.isbn	979-833156616-6
dc.identifier.orcid	0000-0002-5834-6210
dc.identifier.orcid	0000-0003-3761-1641
dc.identifier.scopus	2-s2.0-105033366763
dc.identifier.startpage	1
dc.identifier.uri	https://doi.org/10.1109/ETECOM66111.2025.11319066
dc.identifier.uri	https://hdl.handle.net/20.500.12436/9402
dc.indekslendigikaynak	Scopus
dc.language.iso	en
dc.publisher	Institute of Electrical and Electronics Engineers Inc.
dc.relation.ispartof	2025 IEEE International Conference on Emerging Trends in Engineering and Computing, ETECOM 2025
dc.relation.publicationcategory	Konferans Öğesi - Uluslararası - Kurum Öğretim Elemanı
dc.rights	info:eu-repo/semantics/closedAccess
dc.subject	HNSW Indexing
dc.subject	Reranking
dc.subject	Retrieval-Augmented Generation
dc.subject	Similarity Metrics
dc.subject	Vector Databases
dc.title	Systematic Evaluation of Similarity Metrics for Retrieval, Reranking, and Completion in Retrieval Augmented Generation Systems
dc.type	Conference Object
dspace.entity.type	Publication
relation.isAuthorOfPublication	f9b9b46c-d923-42d3-b413-dd851c2e913a
relation.isAuthorOfPublication.latestForDiscovery	f9b9b46c-d923-42d3-b413-dd851c2e913a

Dosyalar

Lisans paketi

Listeleniyor 1 - 1 / 1

İsim:: license.txt
Boyut:: 1.17 KB
Biçim:: Item-specific license agreed upon to submission
Açıklama:

İndir

Koleksiyon

Bilgisayar Mühendisliği Bölümü Koleksiyonu
Scopus İndeksli Yayınlar Koleksiyonu