zur Startseite

Publications

To get an overview of my publications, refer to the list below, to my DBLP entry or Google Scholar's result.


2019

LuPe: A System for Personalized and Transparent Data-driven Decisions
Sarah Oppold and Melanie Herschel
International Conference on Information and Knowledge Management (CIKM), Beijing, China, 2019

Towards Integrating Collaborative Filtering in Visual Data Exploration Systems
Houssem Ben Lahmar and Melanie Herschel
European Conference on Advances in Databases and Information Systems, Bled, Slovenia, 2019

Capturing and querying structural provenance in Spark with Pebble
Ralf Diestankämper, Melanie Herschel
ACM SIG Conference on the Management of Data (SIGMOD), Amsterdam, The Netherlands, 2019

Volume-based large dynamic graph analysis supported by evolution provenance
Valentin Bruder, Houssem Ben Lahmar, Marcel Hlawatsch, Steffen Frey, Michael Burch, Daniel Weiskopf, Melanie Herschel, Thomas Ertl
Multimedia Tools and Applications. 10.1007/s11042-019-07878-6, 2019

Towards task-based parallelization for entity resolution
Leonardo Gazzarri, Melanie Herschel
Special Issue of the Springer Journal Software-Intensive Cyber-Physical Systems (SICS)

Query-based Why-not Explanations for Nested Data
Ralf Diestelkämper, Boris Glavic, Melanie Herschel, Seokki Lee
Workshop on Theory and Practice of Provenance (TaPP), Philadelphia, PA, USA

Structural summaries for visual provenance analysis
Houssem Ben Lahmar, Melanie Herschel
Workshop on Theory and Practice of Provenance (TaPP), Philadelphia, PA, USA

Advances in Database Technology - 22nd International Conference on Extending Database Technology, EDBT 2019, Lisbon, Portugal, March 26-29, 2019, Proceedings
Melanie Herschel, Helena Galhardas, Berthold Reinwald, Irini Fundulaki, Carsten Binnig, Zoi Kaoudi
OpenProceedings.org 2019, ISBN 978-3-89318-081-3

2018

Simultaneous Visual Analysis of Multiple Software Hierarchies
Christoph Schulz, Adrian Zeyfang, Mereke van Garderen, Houssem Ben Lahmar, Melanie Herschel, Daniel Weiskopf
IEEE Working Conference on Software Visualization (VISSOFT), 2018.

Provenance for Entity Resolution
Sarah Oppold and Melanie Herschel
International Provenance and Annotation Workshop (IPAW), Provenance Week 2018, London, UK, 2018.

Proceedings of the 10th USENIX Workshop on the Theory and Practice of Provenance (TaPP)
Melanie Herschel USENIX Workshop on the Theory and Practice of Provenance (TaPP), Provenance Week 2018, London, UK, 2018.

Provenance-Based Visual Data Exploration with EVLIN
Houssem Ben Lahmar, Melanie Herschel, Michael Blumenschein, Daniel A. Keim
International Conference on Extending Database Technology (EDBT), Vienna, Austria, 2018.

2017

A survey on provenance: What for? What form? What from?
Melanie Herschel, Ralf Diestelkämper, Houssem Ben Lahmar
The VLDB Journal, vol. 26, no. (6), pages 881-906, 2017. A preprint-version is available here. The final publication is available at link.springer.com.

Provenance in DISC Systems: Reducing Space Overhead at Runtime.
Ralf Diestelkämper and Melanie Herschel
USENIX Workshop on the Theory and Practice of Provenance (TAPP), Seattle, USA, 2017.

Provenance-based Recommendations for Visual Data Exploration
Houssem Ben Lahmar and Melanie Herschel
USENIX Workshop on the Theory and Practice of Provenance (TAPP), Seattle, USA, 2017.

Proceedings of the 17th national conference "Datenbanksysteme für Business, Technologie und Web (BTW)"
Bernhard Mitschang, Daniela Nicklas, Frank Leymann, Harald Schöning, Melanie Herschel, Jens Teubner, Theo Härder, Oliver Kopp, Matthias Wieland
17. Fachtagung des GI-Fachbereichs „Datenbanken und Informationssysteme" (DBIS), 2017, Stuttgart, Germany.

2016

Reuse-based Optimization for Pig Latin
Jesus Camacho-Rodriguez, Dario Colazzo, Melanie Herschel, Ioana Manolescu, and Soudip Roy Chowdhury
International Conference on Information and Knowledge Management (CIKM), Indianapolis, USA, 2016.

Provenance: On and Behind the Screens (tutorial, also see website)
Melanie Herschel and Marcel Hlawatsch
International Conference on the Management of Data (SIGMOD), San Francisco, USA, 2016.

Refining SQL Queries based on Why-Not Polynomials
Nicole Bidoit, Melanie Herschel, and Katerina Tzompanaki
USENIX Workshop on the Theory and Practice of Provenance (TAPP), Washington D.C., USA, 2016.

A Tutorial on Instance Matching Benchmarks (tutorial, also see website)
Irini Fundulaki, Melanie Herschel, and Tzanina Saveta
European Semantic Web Conference (ESWC), Heraklion, Greece, 2016.

2015

Immutably answering Why-Not questions for equivalent conjunctive queries
Nicole Bidoit, Melanie Herschel, and Katerina Tzompanaki
Ingénierie des Systèmes d'Information 20(5): 27-52, 2015.

Efficient computation of polynomial explanations of Why-Not questions
Nicole Bidoit, Melanie Herschel, and Katerina Tzompanaki
International Conference on Information and Knowledge Management (CIKM), Melbourne, Australia, 2015.

LANCE: Piercing to the Heart of Instance Matching Tools
Tzanina Saveta, Evangelia Daskalaki, Giorgos Flouris, Irini Fundulaki, Melanie Herschel, and Axel-Cyrille Ngonga Ngomo
International Semantic Web Conference (ISWC), Bethlehem, Pennsylvania, USA, 2015.

EFQ: Why-Not Polynomials in Action
Nicole Bidoit, Melanie Herschel, and Katerina Tzompanaki
Proceedings of the VLDB Endowment (PVLDB), Vol. 8, No. 12, 2015.

Pushing the Limits of Instance Matching Systems: A Semantics-Aware Benchmark for Linked Data
Tzanina Saveta, Evangelia Daskalaki, Giorgos Flouris, Irini Fundulaki, Melanie Herschel, and Axel-Cyrille Ngonga Ngomo
International Conference on World Wide Web (WWW) - Companion Volume, Florence, Italy, 2015.

A Hybrid Approach to Answering Why-Not Questions on Relational Query Results
Melanie Herschel
Journal of Data and Information Quality (JDIQ) - Special Issue on Provenance, Data and Information Quality, vol. 5 num. 3, 2015.

2014

Instance Matching Benchmarks for Linked Data (tutorial, also see website)
Irini Fundulaki, Evangelia Daskalaki, Melanie Herschel, Janina Saveta
International Semantic Web Conference (ISWC), Riva del Garda - Trentino, Italy, 2014.

Immutably Answering Why-Not Questions for Equivalent Conjunctive Queries
Nicole Bidoit, Melanie Herschel, Katerina Tzompanaki
USENIX Workshop on the Theory and Practice of Provenance (TAPP), Cologne, Germany, 2014.

Query-Based Why-Not Provenance with NedExplain
Nicole Bidoit, Melanie Herschel, Katerina Tzompanaki
International Conference on Extending Database Technology (EDBT), Athens, Greece, 2014.

Entity Resolution in the Web of Data (tutorial, also see website)
Kostas Stefanidis, Vasilis Efthymiou, Melanie Herschel and Vassilis Christophides
International World Wide Web Conference (WWW), Seoul, Korea, 2014.

2013

Wondering why data are missing from query results? Ask Conseil Why-Not
Melanie Herschel
International Conference on Information and Knowledge Management (CIKM), San Francisco, USA, 2013.

Entity Resolution in the Web of Data (tutorial)
Kostas Stefanidis, Vasilis Efthymiou, Melanie Herschel, Vassilis Christophides
International Conference on Information and Knowledge Management (CIKM), San Francisco, USA, 2013.

Answering Why-Not Questions
Nicole Bidoit, Melanie Herschel, Katerina Tzompanaki
Bases de Données Avancées (BDA), Nantes, France, 2013.

Entity Resolution (tutorial)
Melanie Herschel
International Workshop on Open Data (WOD), Paris, France, 2013.

Efficient and Effective Duplicate Detection in Hierarchical Data
Luís Leitão, Pável Calado, and Melanie Herschel
IEEE Transactions on Knowledge and Data Engineering (TKDE), vol. 25, num. 5, pages 1028 - 1041, 2013.


2012

Scalable Iterative Graph Duplicate Detection
Melanie Herschel, Felix Naumann, Sascha Szott, Maik Taubert
IEEE Transactions on Knowledge and Data Engineering (TKDE) vol. 24, num. 11, pages 2094-2108, 2012.

The Nautilus Analyzer: Understanding and Debugging Data Transformations
Melanie Herschel and Hanno Eichelberger
International Conference on Information and Knowledge Management (CIKM), Maui, USA, 2012.

Data Bridges: Data Integration for Digital Cities
Melanie Herschel and Ioana Manolescu
CIKM 2012 City Data Management Workshop (CDMW), Maui, USA, 2012.

Application de mesures de distance pour la détection de problèmes de qualité de données.
Melanie Herschel and Laure Berti
Chapter In La qualité et la gouvernance de données au service de la performance des entreprises, Ed. Hermes, 09/2012.

Data Integration
Melanie Herschel
it - Information Technology, Vol. 54, No. 3, 2012.


2011

Eliminating NULLs with Subsumption and Complementation
Jens Bleiholder, Melanie Herschel, and Felix Naumann
Bulletin of the IEEE Computer Society Technical Committee on Data Engineering, September 2011, Vol. 34, No. 3., 2011.

Transformation Lifecycle Management with Nautilus
Melanie Herschel, Torsten Grust
VLDB 2011 Workshop on Quality in Databases (QDB). Seattle, USA, 2011.

Proceedings of the First International Workshop on Managing Data Throughout its Lifecycle (DaLi)
Carlo. A. Curino, Melanie Herschel, Paolo Papotti
Collocated with ICDE 2011, Hannover, Germany.


2010

Explaining Missing Answers to SPJUA Queries
Melanie Herschel, Mauricio A. Hernández
Proceedings of the VLDB Endowment, Vol. 3, No. 1, 2010.

An Overview of XML Duplicate Detection Algorithms
Pável Calado, Melanie Herschel, and Luís Leitão
Chapter in Soft Computing in XML Data Management, Studies in Fuzziness and Soft Computing, Vol. 255. Springer, 2010.

An Introduction to Duplicate Detection
Felix Naumann, Melanie Herschel
Synthesis Lectures on Data Management, Morgan and Claypool, 2010.

Subsumption and Complementation as Data Fusion Operators
Jens Bleiholder, Sascha Szott, Melanie Herschel, Frank Kaufer, Felix Naumann
International Conference on Extending Database Technology (EDBT), Lausanne, Switzerland, 2010.

Complement Union for Data Integration
Jens Bleiholder, Sascha Szott, Melanie Herschel, Felix Naumann
ICDE 2010 Workshop on New Trends in Information Integration (NTII), Long Beach, USA, 2010.


2009

Artemis: A System for Analyzing Missing Answers
Melanie Herschel, Mauricio A. Hernandez, Wang Chiew Tan
In Proceedings of the VLDB Endowment, Vol. 2, 2009.

Dublettenerkennung unter Berücksichtigung von Datenabhängigkeiten (Duplicate Detection Exploiting Data Relationships)
Melanie Herschel
it - Information Technology Vol. 51, No. 4, 2009.

Proceedings of the 7th International Workshop on the Quality of Data (QDB 2009)
Laure Berti-Equille, Melanie Herschel, and Ahmed K. Elmagarmid
Collocated with VLDB 2009, Lyon, France.


2008

Scaling up duplicate detection in graph data
Melanie Herschel, Felix Naumann.
International Conference on Information and Knowledge Management (CIKM), Napa, USA, 2008.

Duplicate Detection in XML (PhD thesis)
Melanie Herschel
Wiku-Verlag - Verlag für Wissenschaft und Kultur, 2008.

Industry-scale duplicate detection
Melanie Weis, Felix Naumann, Ulrich Jehle, Jens Lufter, Holger Schuster
Proceedings of the VLDB Endowment, Vol. 1, No. 2, 2008.

Dublettenerkennung in komplex strukturierten Daten.
Melanie Weis
In Dorothea Wagner et al., editor, Ausgezeichnete Informatikdissertationen 2007.
GI-Edition Lecture Notes in Informatics (LNI), Bonner Köllen Verlag, 2008.

Space and Time Scalability of Duplicate Detection in Graph Data.
Melanie Weis and Felix Naumann
Technical report 25, Hasso-Plattner-Institut, 2008.


2007

Structure-Based Inference of XML Similarity for Fuzzy Duplicate Detection
Luis Leitao, Pavel Calado, and Melanie Weis
International Conference on Information and Knowledge Management (CIKM), Lisboa, Portugal, 2007.

Declarative XML Data Cleaning with XClean
Melanie Weis and Ioana Manolescu
International Conference on Advanced Information Systems Engineering (CAISE), Trondheim, Norway, 2007.

XClean in Action (demo)
Melanie Weis and Ioana Manolescu
In Conference on Innovative Databasz Research (CIDR) , Asilomar, USA, 2007.


2006

XML Duplicate Detection Using Sorted Neighborhoods
Sven Puhlmann, Melanie Weis and Felix Naumann
Conference on Extending Database Technology (EDBT), Munich, Germany, 2006.

A Duplicate Detection Benchmark for XML (and Relational) Data
Melanie Weis, Felix Naumann and Franziska Brosy
SIGMOD 2006 Workshop on Information Quality for Information Systems (IQIS), Chicago, IL, 2006.

XStruct: Efficient Schema Extraction from Multiple and Large XML Documents
Jan Hegewald, Felix Naumann and Melanie Weis
ICDE 2006 Workshop on XML Schema and Data Management (XSDM), Atlanta, Georgia, 2006.

Detecting Duplicates in Complex XML Data (poster)
Melanie Weis and Felix Naumann
International Conference on Data Engineering (ICDE), Atlanta, Georgia, 2006.

Data Fusion in Three Steps: Resolving Schema, Tuple, and Value Inconsistencies
Felix Naumann, Alexander Bilke, Jens Bleiholder, Melanie Weis
Bulletin of the Technical Committee on Data Engineering, Vol. 29 No. 2, June 2006.

Relationship-Based Duplicate Detection.
Melanie Weis and Felix Naumann
Technical Report No. HU-IB-206, July 2006.


2005

DogmatiX Tracks down Duplicates in XML
Melanie Weis and Felix Naumann.
International Conference on the Management of Data (SIGMOD), Baltimore, MD, 2005.

Fuzzy Duplicate Detection on XML Data
Melanie Weis.
VLDB 2005 PhD Workshop, Trondheim, Norway, 2005.

Automatic Data Fusion with HumMer (demo)
Alexander Bilke, Jens Bleiholder, Christoph Böhm, Karsten Draba, Felix Naumann, Melanie Weis.
International Conference on Very Large Database (VLDB), Troindheim, Norway, 2005.

Erkennen und Bereinigen von Datenfehlern in naturwissenschaftlichen Daten (german)
Heiko Müller, Melanie Weis, Jens Bleiholder and Ulf Leser.
Datenbank-Spektrum, Heft 15, November 2005.


2004

Eine Übung zur Vorlesung Informationsintegration (german)
Felix Naumann, Jens Bleiholder, Melanie Weis.
Datenbank-Spektrum Heft 11, November 2004.

Detecting Duplicate Objects in XML Documents
Melanie Weis and Felix Naumann.
SIGMOD 2004 Workshop on Information Quality for Information Systems (IQIS) , Paris, France, 2004.