Document Type
Article
Publication Date
3-26-2024
Department
Department of Computer Science
Abstract
While the rise of large language models (LLMs) has created rich new opportunities to learn about digital technology, many on the margins of this technology struggle to gain and maintain competency due to lexical or conceptual barriers that prevent them from asking appropriate questions. Although there have been many efforts to understand factuality of LLM-created content and ability of LLMs to answer questions, it is not well understood how unclear or nonstandard language queries affect the model outputs. We propose the creation of a dataset that captures questions of digital newcomers and outsiders, utilizing data we have compiled from a decade's worth of one-on-one tutoring. In this paper we lay out our planned efforts and some potential uses of this dataset.
Publication Title
arXiv
Recommended Citation
Lucas, E.,
Steelman, K. S.,
Ureel, L.,
&
Wallace, C.
(2024).
For those who don't know (how) to ask: Building a dataset of technology questions for digital newcomers.
arXiv.
http://doi.org/10.48550/arXiv.2403.18125
Retrieved from: https://digitalcommons.mtu.edu/michigantech-p2/1193
Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 International License.
Version
Publisher's PDF
Publisher's Statement
© 2024. Publisher’s version of record: https://doi.org/10.48550/arXiv.2403.18125