In the Technology Enhanced Learning (TEL) community, the problem of conducting reproducible evaluations of rec-ommender systems is still open, due to the lack of exhaustive benchmarks. The few public datasets available in TEL have limitations, being mostly small and local. Recently, Massive Open Online Courses (MOOC) are attracting many studies in TEL, mainly because of the huge amount of data for these courses and their potential for many applications in TEL. This paper presents DAJEE, a dataset built from the crawling of MOOCs hosted on the Coursera platform. DAJEE offers information on the usage of more than 20,000 resources in 407 courses by 484 instructors, with a conjunction of different educational entities in order to store the courses' structure and the instructors' teaching experiences.
Estivill Castro, V., Limongelli, C., Lombardi, M., Marani, A. (2016). DAJEE: A dataset of joint educational entities for information retrieval in technology enhanced learning. In SIGIR 2016 - Proceedings of the 39th International ACM SIGIR Conference on Research and Development in Information Retrieval (pp.681-684) [10.1145/2911451.2914670].
DAJEE: A dataset of joint educational entities for information retrieval in technology enhanced learning
LIMONGELLI, Carla;LOMBARDI, MATTEO;MARANI, ALESSANDRO
2016-01-01
Abstract
In the Technology Enhanced Learning (TEL) community, the problem of conducting reproducible evaluations of rec-ommender systems is still open, due to the lack of exhaustive benchmarks. The few public datasets available in TEL have limitations, being mostly small and local. Recently, Massive Open Online Courses (MOOC) are attracting many studies in TEL, mainly because of the huge amount of data for these courses and their potential for many applications in TEL. This paper presents DAJEE, a dataset built from the crawling of MOOCs hosted on the Coursera platform. DAJEE offers information on the usage of more than 20,000 resources in 407 courses by 484 instructors, with a conjunction of different educational entities in order to store the courses' structure and the instructors' teaching experiences.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.