A | B | C | D | E | F 
 G | H | I | J | K | L | M 
 N | O | P | Q | R | S | T 
 U | V | W | X | Y | Z 
max planck institut
informatik
mpii logo Minerva of the Max Planck Society

Homepage

Rainer Gemulla

Rainer Gemulla

Max-Planck-Institut für Informatik
Department 5: Databases and Information Systems
Campus E1 4, Room 404
66123 Saarbrücken
Germany

Email: Get my email address via email
Phone: +49 681 9325 5004
Fax: +49 681 9325 599

I was heading the research group on Scalable Management of Uncertain Data at Department 5 of the Max-Planck-Institut für Informatik. In August 2014, I joined the the Data and Web Science research group at the University of Mannheim; see my new homepage.


Research Interests




Ph.D. Students




Teaching



     Current semester (FSS 2014):
     Past semesters:

Awards




Publications


2014    P. Roy, J. Teubner, R. Gemulla
Low-Latency Handshake Join
To appear in PVLDB, 2014
L. Qu, Y. Zhang, R. Wang, L. Jiang, R. Gemulla, G. Weikum
Senti-LSSVM: Sentiment-Oriented Multi-Relation Extraction with Latent Structural SVM [pdf]
To appear in TACL, 2014
D. Erdös, R. Gemulla, E. Terzi
Reconstructing Graphs from Neighborhood Data [pdf (author version)]
To appear in TKDD, 2014
2013    F. Makari, C. Teflioudi, R. Gemulla, P. J. Haas, Y. Sismanis
Shared-Memory and Shared-Nothing Stochastic Gradient Descent Algorithms for Matrix Completion [pdf (author version)]
To appear in KAIS (special issue: best papers of ICDM 2012), 2013
F. Makari, R. Gemulla
A Distributed Approximation Algorithm for Mixed Packing-Covering Linear Programs [pdf]
In NIPS 2013 Biglearn workshop (poster), 2013
F. Makari, B. Awerbuch, R. Gemulla, R. Khandekar, J. Mestre, M. Sozio
A Distributed Algorithm for Large-Scale Generalized Matching [pdf]
The analysis of the number of binary search steps (Lemma 2) contains a bug; see our Biglearn paper for a corrected version.
In PVLDB, 6(9), pp. 613-624, 2013
I. Miliaraki, K. Berberich, R. Gemulla, S. Zoupanos
Mind the Gap: Large-Scale Frequent Sequence Mining [pdf, slides, resources]
In SIGMOD, pp. 797-808, 2013
L. Del Corro, R. Gemulla
ClausIE: Clause-Based Open Information Extraction [pdf, slides, resources]
In WWW, pp. 355-366, 2013
R. Gemulla, P. J. Haas, W. Lehner
Non-Uniformity Issues and Workarounds in Bounded-Size Sampling [pdf (author version), pdf (journal version), source code]
In The VLDB Journal, 22(6), pp. 753-772, 2013
K. Beedkar, L. Del Corro, R. Gemulla
Fully Parallel Inference in Markov Logic Networks [pdf]
In BTW, pp. 205-224, 2013
2012    D. Erdös, R. Gemulla, E. Terzi
Reconstructing Graphs from Neighborhood Data [pdf, slides]
In ICDM, pp. 231-240, 2012
C. Teflioudi, F. Makari, R. Gemulla
Distributed Matrix Completion [pdf, slides]
In ICDM, pp. 655-664, 2012
L. Qu, R. Gemulla, G. Weikum
A Weakly Supervised Model for Sentence-Level Semantic Orientation Analysis with Multiple Experts [pdf]
In EMNLP-CoNLL, pp. 149-159, 2012
2011    R. Gemulla, P. J. Haas, Y. Sismanis, C. Teflioudi, F. Makari
Large-Scale Matrix Factorization with Distributed Stochastic Gradient Descent [pdf, slides]
In NIPS 2011 Biglearn workshop, 2011 (best paper award)

R. Gemulla, E. Nijkamp, P. J. Haas, Y. Sismanis
Large-Scale Matrix Factorization with Distributed Stochastic Gradient Descent [pdf, slides]
In KDD, pp. 69-77, 2011

K. Beyer, V. Ercegovac, R. Gemulla, A. Balmin, M. Eltabakh, C.C. Kanne, F. Ozcan, E. Shekita
Jaql: A Scripting Language for Large Scale Semistructured Data Analysis [pdf]
In PVLDB (industrial track), 4(11), pp. 1272-1283, 2011

M. Y. Eltabakh, Y. Tian, F. Özcan, R. Gemulla, A. Krettek, J. McPherson
CoHadoop: Flexible Data Placement and Its Exploitation in Hadoop [pdf]
In PVLDB, 4(9), pp. 575-585, 2011

R. Gemulla, P. J. Haas, E. Nijkamp, Y. Sismanis
Large-Scale Matrix Factorization with Distributed Stochastic Gradient Descent [pdf]
IBM Research Report RJ10481, March 2011 Revised February, 2013

B. Schlegel, R. Gemulla, W. Lehner
Memory-Efficient Frequent-Itemset Mining [pdf]
In EDBT, pp. 461-472, 2011
2010    S. Das, Y. Sismanis, K. S. Beyer, R. Gemulla, P. J. Haas, J. McPherson.
Ricardo: Integrating R and Hadoop [pdf]
In SIGMOD (industrial track), pp. 987-998, 2010

B. Schlegel, R. Gemulla, W. Lehner.
Fast Integer Compression using SIMD Instructions [pdf]
In DAMON, pp. 34-40, 2010
2009    K. Beyer, R. Gemulla. P. J. Haas, B. Reinwald, Y. Sismanis.
Distinct-Value Synopses for Multiset Operations [pdf]
In Commun. ACM, 52(10), pp. 87-95, 2009
Technical perspective by Surajit Chaudhuri.

B. Schlegel, R. Gemulla, W. Lehner.
k-Ary Search on Modern Processors [pdf, slides]
In DAMON, pp. 52-60, 2009
2008    R. Gemulla.
Sampling Algorithms for Evolving Datasets [pdf, summary, slides]
Ph.D. thesis, Technische Universität Dresden, 2009
URL for citations: http://nbn-resolving.de/urn:nbn:de:bsz:14-ds-1224861856184-11644

R. Gemulla, P. Rösch and W. Lehner.
Linked Bernoulli Synopses: Sampling Along Foreign Keys [pdf, slides]
In SSDBM, pp. 6-23, 2008

R. Gemulla and W. Lehner.
Sampling Time-Based Sliding Windows in Bounded Space [pdf, slides]
In SIGMOD, pp. 379-392, 2008

P. Rösch, R. Gemulla and W. Lehner.
Designing Random Sample Synopses with Outliers [pdf, poster]
In ICDE (poster), pp. 1400-1402, 2008
2007    R. Gemulla, W. Lehner and P.J. Haas.
Maintaining Bounded-Size Sample Synopses of Evolving Datasets [pdf]
In The VLDB Journal, Special Issue: Best Papers of VLDB 2006, pp. 173-201, 2007
Note: The resizing algorithm proposed in this article contains a bug; see my Ph.D. thesis or our 2013 VLDB Journal paper for a corrected version.

K. Beyer, P. J. Haas, B. Reinwald, Y. Sismanis and R. Gemulla.
On Synopses for Distinct-Value Estimation Under Multiset Operations [pdf, slides]
In SIGMOD, pp. 199-210, 2007

R. Gemulla, W. Lehner and P. J. Haas.
Maintaining Bernoulli Samples over Evolving Multisets [pdf, slides]
In PODS, pp. 93-102, 2007
2006    R. Gemulla, W. Lehner and P. J. Haas.
A Dip in the Reservoir: Maintaining Sample Synopses of Evolving Datasets [pdf, slides]
In VLDB, pp. 595-606, 2006

A. Klein, R. Gemulla, P. Rösch and W. Lehner.
Derby/S: A DBMS for Sample-Based Query Answering [pdf, poster1, poster2]
In SIGMOD (demo), pp. 757-759, 2006

R. Gemulla and W. Lehner.
Deferred Maintenance of Disk-Based Random Samples [pdf, slides]
In EDBT, pp. 423-441, 2006

Other