Selma: A Semantic Local Code Search Platform

Research output: Contribution to book/Conference proceedings/Anthology/ReportConference contributionContributedpeer-review

Contributors

  • Anja Reusch - , Chair of Databases, TUD Dresden University of Technology (Author)
  • Guilherme C. Lopes - , TUD Dresden University of Technology (Author)
  • Wilhelm Pertsch - , Chair of Databases, TUD Dresden University of Technology (Author)
  • Hannes Ueck - , TUD Dresden University of Technology (Author)
  • Julius Gonsior - , Chair of Databases, TUD Dresden University of Technology (Author)
  • Wolfgang Lehner - , Chair of Databases, TUD Dresden University of Technology (Author)

Abstract

Searching for the right code snippet is cumbersome and not a trivial task. Online platforms such as Github.com or searchcode.com provide tools to search, but they are limited to publicly available and internet-hosted code. However, during the development of research prototypes or confidential tools, it is preferable to store source code locally. Consequently, the use of external code search tools becomes impractical. Here, we present Selma (Code and Videos: https://anreu.github.io/selma): a local code search platform that enables term-based and semantic retrieval of source code. Selma searches code and comments, annotates undocumented code to enable term-based search in natural language, and trains neural models for code retrieval.

Details

Original languageEnglish
Title of host publicationAdvances in Information Retrieval - 46th European Conference on Information Retrieval, ECIR 2024, Proceedings
EditorsNazli Goharian, Nicola Tonellotto, Yulan He, Aldo Lipani, Graham McDonald, Craig Macdonald, Iadh Ounis
PublisherSpringer Science and Business Media B.V.
Pages218-222
Number of pages5
ISBN (print)9783031560682
Publication statusPublished - 2024
Peer-reviewedYes

Publication series

SeriesLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume14612 LNCS
ISSN0302-9743

Conference

Title46th European Conference on Information Retrieval
Abbreviated titleECIR 2024
Conference number46
Duration24 - 28 March 2024
Website
LocationRadisson Blu Hotel
CityGlasgow
CountryUnited Kingdom

Keywords

Keywords

  • Code Retrieval, Transformer Models