Extracting mathematical semantics from documents

    Onderzoeksoutput: Hoofdstuk in Boek/Rapport/CongresprocedureConferentiebijdrageAcademicpeer review

    9 Citaten (Scopus)
    1 Downloads (Pure)

    Samenvatting

    We report on a project to use SGLR parsing and term rewriting with ELAN4 to extract the semantics of mathematical formulas from a TeX document and representing them in MathML. The TeX document we used is part of the Digital Library of Mathematical Functions (DLMF) project of the US National Institute of Standards and Technology (NIST) and obeys project-specific conventions, which contains macros for mathematical constructions, among them 200 predefined macros for special functions, the subject matter of the project. The SGLR parser can parse general context-free languages, which suffices to extract the structure of mathematical formulas from calculus that are written in the usual mathematical style, with most parentheses and multiplication signs omitted. The parse tree is then rewritten into a more concise and uniform internal syntax that is used as the base for extracting MathML or other semantical information.
    Originele taal-2Engels
    TitelPrinciples and Practice of Semantic Web Reasoning (Proceedings International Workshop, PPSWR 2003, Mumbai, India, December 8, 2003)
    RedacteurenF. Bry, N. Henze, J. Maluszynski
    Plaats van productieBerlin
    UitgeverijSpringer
    Pagina's160-173
    ISBN van geprinte versie3-540-20582-9
    DOI's
    StatusGepubliceerd - 2003

    Publicatie series

    NaamLecture Notes in Computer Science
    Volume2901
    ISSN van geprinte versie0302-9743

    Vingerafdruk

    Duik in de onderzoeksthema's van 'Extracting mathematical semantics from documents'. Samen vormen ze een unieke vingerafdruk.

    Citeer dit