Department of Computer Science
Jiaheng Lu's homepage
Jiaheng Lu

Associate Professor

Group leader of UDBMS

Department of Computer Science

University of Helsinki

Email :

Office : Exactum C211

[Suomi] [中文]

Short biography

Research goal: Improving the performance and usability of databases systems

I am a computer scientist and a teacher, with a research interest in databases and data management. My recent topics include multi-model database management systems, semantic string processing and job optimization for big data platform.

I was awarded Ph.D. degree in 2007 from the National University of Singapore. My PhD topic was about XML query processing. I did two years Postdoc research at the University of California, Irvine. Then I joined the Renmin University of China in 2008, where I have worked for seven years. I am now working at the University of Helsinki, Finland. I have the broad research and teaching experiences in four countries (China, Singapore, USA, and Finland).


One of my books on Hadoop is awarded as one of the Top 10 Bestselling IT Books in China.



  1. I gave a research talk in Oracle, California "A Categorical Framework on Multi-Model Databases" [PDF](7.9.2019).
  2. We will give a new tutorial in CIKM 2019! "Synergy of Database Techniques and Machine Learning Models for String Similarity Search and Join" [Details](7.9.2019).
  3. Two papers are accepted in VLDB 2019! One research paper: "Towards a Unified Framework for String Similarity Joins" [PDF], [Source code] , [Slides] and one demo paper: "PivotE: Revealing and Visualizing the Underlying EntityStructures for Exploration" [PDF] (22.6.2019).
  4. We will give a new tutorial on autonomas performance tuning in VLDB 2019! See more information here (24.4.2019).
  5. One new survey paper (38 pages) on multi-model databases (to appear in ACM Computing Surveys)! [PDF] (6.3.2019).
  6. A new Postdoc Researcher Dr. Qingsong Guo joined our research group in Helsinki on 14.1.2019. Welcome Qingsong! (22.1.2019).
  7. Congratulate PhD student: Dr. Zhaoan Dong (in Renmin University of China) successfully defended his PhD thesis! Title: A Study of Crowdsourcing-Based Knowledge Acquisition. (12.12.2018).
  8. A new Postdoc Researcher Dr. Lizhen Fu joined our research group in Helsinki on 15.11.2018. Welcome Lizhen! (16.11.2018).
  9. We will give a tutorial on CIKM 2018 for Multi-model Databases and Tightly Integrated Polystores. See Slides, More information. (09.09.2018)
  10. We published three papers on the vision and benchmark for multi-model databases: Vision 1, Vision 2, Benchmark (05.08.2018).
  11. More news ...

Research Topics

Selected papers:

  1. Jiaheng Lu, Irena Holubova : Multi-model Databases: A New Journey to Handle the Variety of Data, ACM Computing Surveys 2019 [PDF]
  2. Jiaheng Lu, Irena Holubova, Bogdan Cautis: Multi-model Databases and Tightly Integrated Polystores CIKM 2018 Tutorial[PDF]
  3. Jiaheng Lu: Towards Benchmarking Multi-Model Databases(Abstract) CIDR 2017[PDF]
  4. Jiaheng Lu, Irena Holubova: Multi-model Data Management: What's New and What's Next? EDBT 2017 Tutorial [PDF][slides]
  5. Chao Zhang, Jiaheng Lu, Pengfei Xu, Yuxing Chen: UniBench: A Benchmark for Multi-model Database Management Systems. TPCTC 2018: 7-23 [PDF]

Selected papers:

  1. Pengfei Xu, Jiaheng Lu: Towards a Unified Framework for String Similarity Joins. PVLDB 12(12) 2019: [PDF], [Slides], [Source Codes]
  2. Pengfei Xu, Jiaheng Lu: Top-k String Auto-Completion with Synonyms. DASFAA (2) 2017: 202-218 [PDF, Slides, Source Codes]
  3. Jiaheng Lu, Chunbin Lin, Wei Wang, Chen Li, Xiaokui Xiao: Boosting the Quality of Approximate String Matching by Synonyms. ACM Trans. Database Syst. 40(3): 15 (2015) [PDF]
  4. Jiaheng Lu, Chunbin Lin, Wei Wang, Chen Li, Haiyong Wang: String similarity measures and joins with synonyms. SIGMOD Conference 2013: 373-384 [PDF]
More research topics and papers ...

Codes and dataset release


PhD students

  1. Gongsheng Yuan (2017-)
  2. Yuxing Chen (2017-)
  3. Pengfei Xu (2016-)
  4. Chao Zhang (2015-)
  5. Yu Liu (RenminU niversity of China) (2014-2018) (Co-supervised with Prof. Zhewei Wei)
  6. Juwei Shi (Renmin University of China) (2013-2018)
  7. Zhaoan Dong (Renmin University of China) (2013-2018)
  8. (Co-supervised with Prof. Xiaofang Zhou and Prof. Ju Fan)

    Detailed information ...

Academic service

Workshop co-chair:

  1. Workshop co-chair in ER 2018.
  2. Keyword search and data exploratory workshop 2016 with ICDE 2016
  3. Keyword search on structured data (KEYS) workshop with SIGMOD 2012
  4. XML-DM Workshop with WAIM 2010
  5. Cloud-DB workshop with CIKM 2010

Proceeding chair:

  1. IEEE ICDE Conference 2013

Program Committee:

  1. ACM SIGMOD'2010, 2013, 2014, 2015, 2016 Research track
  2. Very Large Database Conference Proceeding PVLDB 2010, 2015, 2017, 2020
  3. IEEE ICDE Conference 2011, 2017, 2019, 2020
  4. ER Conference 2018, 2019
  5. Database Systems for Advanced Applications Conference DASFAA 2010,2012, 2013, 2014
  6. Asia-Pacific Web Conference APWeb 2008, 2009, 2011, 2013, 2014, 2015
  7. Web-age information management Conference WAIM 2014,2015,2016
  8. WAIM-APWEB Conference 2017
  9. Web System Engineering (WISE) Conference 2009
  10. Chinese Conference on Information Retrieval (CCIR) 2015, 2016
  11. Australia Database Conference ADC 2013, 2017, 2018, 2019