This is an old revision of the document!


A large-scale comparative study of beta testers and standard users [CACM 2018]

   Authors: Vlasta Stavova, Lenka Dedkova, Martin Ukrop and Vashek Matyas

 Primary contact: Vlasta Stavova <vlasta.stavova@mail.muni.cz>

 Journal: Communications of the ACM

Abstract

There have been many studies exposing poor usability of security software for the common end user. However, only a few inspect the usability challenges faced by more knowledgeable users. We conducted an experiment to empirically assess usability of the command line interface of OpenSSL, a well known and widely used cryptographic library. Based on the results, we try to propose specific improvements that would encourage more secure behavior. We observed 87 developers/administrators at two certificate-related tasks in a controlled environment. Furthermore, we collected participant opinions on both the tool interface and available documentation. Based on the overall results, we deem the OpenSSL usability insufficient according to both user opinions and standardized measures. Moreover, the perceived usability seems to be correlated with previous experience and used resources. There was a great disproportion between the participant views of a successful task accomplishment and the reality. A general dissatisfaction with both OpenSSL interface and its manual page was shared among the majority of the participants. As hinted by a participant, OpenSSL gradually “turned into a complicated set of sharp kitchen knives” – it can perform various jobs very well, but laymen risk stabbing themselves in the process. This highlights the necessity of a usable design even for tools targeted at experienced users.

Pre-print PDF   BiBTeX

@Article{2017-cacm-stavova,
  title = {A large-scale comparative study of beta testers and standard users},
  author = {Vlasta Stavova and Lenka Dedkova and Martin Ukrop and Vashek Matyas},
  journal = {To appear in Communications of ACM},
  year = {2018},
  keywords = {usablesec, ESET},
  organization = {ACM},
}

Our research produced the following actionable takeaways for software developers:

  • Using data
    Data you can collect can help you learn who your users and beta testers are. Consider country of origin, software and hardware configuration, and basic demographics.
  • Selecting testers
    The fewer testers you have, the pickier you should be about their selection.
  • Identifying usability issues
    When testing international products, ensure beta testers are culturally representative of regular users to help identify potential localization and cultural usability issues.
  • Ensuring representation
    Most important, testers should be representative of regular users. Keep checking that this is the case or pursue additional rigorous analyses to reach the most credible and applicable conclusions possible.

Beta testers are the first end users outside a software company to use its product. They have been used for decades and are rightly credited not only with finding and reporting bugs, but also with improving general product usability through their feedback and/or the ways they use the product.

In this paper, we investigate whether beta testers represent standard users well enough to allow for the extrapolation of testing data to standard users. We have investigated records of beta testers and standard users of home security solution developed by the IT security software provider ESET. With more than 600 000 participants from more than 180 countries, we present what we believe to be the first large-scale comparison between standard users and beta testers.

We compared several aspects of both populations, such as hardware, operating system, country of origin and EULA reading time, all taken from system data. Other attributes, such as age, gender, privacy perception and computer proficiency self-evaluation, were available thanks to a user questionnaire.

We conclude that - at least in our study - beta users represent standard users well in terms of hardware and operating system in large scale beta testing. However, populations differ significantly in the distribution of users and testers between countries. This may cause a problem when a testing includes localization and usability issues that may be influenced by regional differences.

  • Diploma thesis based on localization data from this project (in czech only). It compares free available online GeoIP databases on a large sample of available IP adresses.

We thank Masaryk University (MUNI/M/1052/2013) and Miroslav Bartosek for support, and to anonymous reviewers and Vit Bukac for valuable feedback.