A comparison of stemming techniques in tracing

David Farrar, Jane Huffman Hayes

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

11 Scopus citations

Abstract

We examine the effects of stemming on the tracing of software engineering artifacts. We compare two common stemming algorithms to each other as well as to a baseline of no stemming. We evaluate the algorithms on eight tracing datasets. We run the experiment using the TraceLab experimental framework to allow for ease of repeatability and knowledge sharing among the tracing community. We compare the algorithms on precision at recall levels of [0.1, 0.2, 0.3, 0.4, 0.5, 0.6, 0.7, 0.8, 0.9, 1.0], as well as on mean average precision values. The experiment indicated that neither the Porter stemmer nor the Krovetz stemmer outperformed the other on all datasets tested.

Original languageEnglish
Title of host publicationProceedings - 2019 IEEE/ACM 10th International Workshop on Software and Systems Traceability, SST 2019
Pages37-44
Number of pages8
ISBN (Electronic)9781728122557
DOIs
StatePublished - May 2019
Event10th IEEE/ACM International Workshop on Software and Systems Traceability, SST 2019 - Montreal, Canada
Duration: May 27 2019 → …

Publication series

NameProceedings - 2019 IEEE/ACM 10th International Workshop on Software and Systems Traceability, SST 2019

Conference

Conference10th IEEE/ACM International Workshop on Software and Systems Traceability, SST 2019
Country/TerritoryCanada
CityMontreal
Period5/27/19 → …

Bibliographical note

Publisher Copyright:
© 2019 IEEE.

Keywords

  • Empirical research
  • Stemming
  • Traceability

ASJC Scopus subject areas

  • Software
  • Safety, Risk, Reliability and Quality

Fingerprint

Dive into the research topics of 'A comparison of stemming techniques in tracing'. Together they form a unique fingerprint.

Cite this