Selma: A Semantic Local Code Search Platform

Publikation: Beitrag in Buch/Konferenzbericht/Sammelband/GutachtenBeitrag in KonferenzbandBeigetragenBegutachtung

Beitragende

Abstract

Searching for the right code snippet is cumbersome and not a trivial task. Online platforms such as Github.com or searchcode.com provide tools to search, but they are limited to publicly available and internet-hosted code. However, during the development of research prototypes or confidential tools, it is preferable to store source code locally. Consequently, the use of external code search tools becomes impractical. Here, we present Selma (Code and Videos: https://anreu.github.io/selma): a local code search platform that enables term-based and semantic retrieval of source code. Selma searches code and comments, annotates undocumented code to enable term-based search in natural language, and trains neural models for code retrieval.

Details

OriginalspracheEnglisch
TitelAdvances in Information Retrieval - 46th European Conference on Information Retrieval, ECIR 2024, Proceedings
Redakteure/-innenNazli Goharian, Nicola Tonellotto, Yulan He, Aldo Lipani, Graham McDonald, Craig Macdonald, Iadh Ounis
Herausgeber (Verlag)Springer Science and Business Media B.V.
Seiten218-222
Seitenumfang5
ISBN (Print)9783031560682
PublikationsstatusVeröffentlicht - 2024
Peer-Review-StatusJa

Publikationsreihe

ReiheLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Band14612 LNCS
ISSN0302-9743

Konferenz

Titel46th European Conference on Information Retrieval
KurztitelECIR 2024
Veranstaltungsnummer46
Dauer24 - 28 März 2024
Webseite
OrtRadisson Blu Hotel
StadtGlasgow
LandGroßbritannien/Vereinigtes Königreich

Schlagworte

Schlagwörter

  • Code Retrieval, Transformer Models