Learning Tree Pattern TransformationsShow others and affiliations
2025 (English)In: 28th International Conference on Database Theory: ICDT 2025, March 25-28, 2025, Barcelona, Spain / [ed] S. Roy, A. Kara, Schloss Dagstuhl- Leibniz-Zentrum fur Informatik GmbH, Dagstuhl Publishing , 2025, Vol. 328, p. 24:1-24:20Conference paper, Published paper (Other academic)
Abstract [en]
Explaining why and how a tree t structurally differs from another tree t* is a question that is encountered throughout computer science, including in understanding tree-structured data such as XML or JSON data. In this article, we explore how to learn explanations for structural differences between pairs of trees from sample data: suppose we are given a set {(t1, t*1), . . ., (tn, t*n)} of pairs of labelled, ordered trees; is there a small set of rules that explains the structural differences between all pairs (ti, t*i)? This raises two research questions: (i) what is a good notion of “rule” in this context?; and (ii) how can sets of rules explaining a data set be learned algorithmically? We explore these questions from the perspective of database theory by (1) introducing a pattern-based specification language for tree transformations; (2) exploring the computational complexity of variants of the above algorithmic problem, e.g. showing NP-hardness for very restricted variants; and (3) discussing how to solve the problem for data from CS education research using SAT solvers.
Place, publisher, year, edition, pages
Schloss Dagstuhl- Leibniz-Zentrum fur Informatik GmbH, Dagstuhl Publishing , 2025. Vol. 328, p. 24:1-24:20
Series
Leibniz International Proceedings in Informatics, ISSN 1868-8969 ; 328
Keywords [en]
computational complexity, learning from positive examples, Tree pattern transformations, Adversarial machine learning, Federated learning, Specification languages, Trees (mathematics), Labeled ordered tree, Learn+, Pattern transformations, Sample data, Set of rules, Structural differences, Tree pattern, Tree pattern transformation, Tree-structured data, Contrastive Learning
National Category
Information Systems
Identifiers
URN: urn:nbn:se:hj:diva-67553DOI: 10.4230/LIPIcs.ICDT.2025.24Scopus ID: 2-s2.0-105001558235ISBN: 978-3-95977-364-5 (print)OAI: oai:DiVA.org:hj-67553DiVA, id: diva2:1952050
Conference
28th International Conference on Database Theory, ICDT 2025 Barcelona 25 March 2025 through 28 March 2025
2025-04-142025-04-142025-04-14Bibliographically approved