Setting goals and choosing metrics for recommender system evaluations

Research output: Contribution to book/Conference proceedings/Anthology/ReportConference contributionContributedpeer-review

Contributors

Abstract

Recommender systems have become an important personalization technique on the web and are widely used especially in e-commerce applications. However, operators of web shops and other platforms are challenged by the large variety of available algorithms and the multitude of their possible parameterizations. Since the quality of the recommendations that are given can have a significant business impact, the selection of a recommender system should be made based on well-founded evaluation data. The literature on recommender system evaluation o?ers a large variety of evaluation metrics but provides little guidance on how to choose among them. This paper focuses on the often neglected aspect of clearly defining the goal of an evaluation and how this goal relates to the selection of an appropriate metric. We discuss several well-known accuracy metrics and analyze how these reflect di?erent evaluation goals. Furthermore we present some less well-known metrics as well as a variation of the area under the curve measure that are particularly suitable for the evaluation of recommender systems in e-commerce applications.

Details

Original languageEnglish
Title of host publicationJoint Workshop on Human Decision Making in Recommender Systems, Decisions@RecSys 2011 and User-Centric Evaluation of Recommender Systems and Their Interfaces-2, UCERSTI 2 - Affiliated with the 5th ACM Conference on Recommender Systems, RecSys 2011
Pages78-85
Number of pages8
Volume811
Publication statusPublished - 2011
Peer-reviewedYes

Publication series

SeriesCEUR Workshop Proceedings
ISSN1613-0073

Conference

TitleJoint Workshop on Human Decision Making in Recommender Systems, Decisions@RecSys 2011 and User-Centric Evaluation of Recommender Systems and Their Interfaces-2, UCERSTI 2 - Affiliated with the 5th ACM Conference on Recommender Systems, RecSys 2011
Duration23 - 26 October 2011
CityChicago, IL
CountryUnited States of America

External IDs

ORCID /0000-0001-8107-2775/work/199961306

Keywords

Research priority areas of TU Dresden

DFG Classification of Subject Areas according to Review Boards

Subject groups, research areas, subject areas according to Destatis

ASJC Scopus subject areas

Keywords

  • Area under the curve, Auc, E-commerce, Evaluation, Informedness, Markedness, Matthews correlation, Measure, Metrics, Precision, Recall, Recommender systems, Roc