A data set of OCL expressions on GitHub

J.F.H. Noten, J.G.M. Mengerink, A. Serebrenik

Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

15 Citations (Scopus)


In model driven engineering (MDE), meta-models are the central artifacts. As a complement, the Object Constraint Language (OCL) is a language used to express constraints and operations on meta-models. The Eclipse Modeling Framework (EMF) provides an implementation of OCL, enabling OCL-annotated meta-models. Existing empirical studies of the OCL have been conducted on small collections of data. To facilitate empirical research into the OCL on a larger scale, we present the first publicly available data set of OCL expressions. The data set contains 9188 OCL expressions originating from 504 EMF meta-models in 245 systematically selected GitHub repositories. Both the original meta-models and the generated abstract syntax trees are included, allowing for a variety of empirical studies of the OCL. To illustrate the applicability of this data set in practice, we performed three case studies.
Original languageEnglish
Title of host publicationProceedings - 2017 IEEE/ACM 14th International Conference on Mining Software Repositories, MSR 2017
Place of PublicationPiscataway
PublisherInstitute of Electrical and Electronics Engineers
Number of pages4
ISBN (Electronic)978-1-5386-1544-7
ISBN (Print)978-1-5386-1545-4
Publication statusPublished - 29 Jun 2017
Event14th International Conference on Mining Software Repositories (MSR 2017) - Buenos Aires, Argentina
Duration: 20 May 201721 May 2017
Conference number: 14


Conference14th International Conference on Mining Software Repositories (MSR 2017)
Abbreviated titleMSR 2017
CityBuenos Aires
Internet address


  • GitHub
  • OCL
  • data set

Cite this