Student Engagement Dataset (SED): An Online Learning Activity Dataset

Document Type

Article

Publication Date

1-1-2025

Abstract

Distance learning has become a popular educational medium, and the Internet has spread since the early 2000s. To leverage this phenomenon, learning analytics and data mining can provide insights into improving pedagogy and assessing student engagement. To this end, a student-centric dataset was constructed by extracting data from Universiti Malaya's Moodle-based Virtual Learning Environment (VLE), which serves approximately 25,000 students annually. In this paper, we present the Student Engagement Dataset (SED). The dataset consists of 16,609 students and 2,407 courses. It contains information such as grades and daily logged online activities (approximately 12 million data points), including temporal data across four tables. The tables include student engagement features created by aggregating raw activity data. Here, we present the dataset's properties and describe the data collection, selection, and processing steps. Correlation analysis of student engagement features showed a statistically significant but weak negative correlation between the number of courses, early morning logins, assignments, and top students' performance. SED is expected to present new opportunities for researchers in the learning analytics domain.

Keywords

Learning analytics, learning management systems (LMSs), online learning, online learning, virtual learning environments (VLEs), virtual learning environments (VLEs), student engagement, student engagement, student engagement

Divisions

ai,CHEMISTRY

Funders

Universiti Malaya (GPF009D-2019)

Publication Title

IEEE Access

Volume

13

Publisher

Institute of Electrical and Electronics Engineers

Publisher Location

445 HOES LANE, PISCATAWAY, NJ 08855-4141 USA

This document is currently not available here.

Share

COinS