Volume 36: Proceedings of the 3rd International Workshop on Big Data, Streams and Heterogeneous Source Mining: Algorithms, Systems, Programming Models and Applications, 24 August 2014, New York, New York, USA


Editors: Wei Fan, Albert Bifet, Qiang Yang, Philip S. Yu



Front Matter


Wei Fan, Albert Bifet, Qiang Yang, Philip S. Yu; PMLR 36:i-ix

Accepted Papers

Parallel Graph Mining with GPUs

Robert Kessl, Nilothpal Talukder, Pranay Anchuri, Mohammed Zaki; PMLR 36:1-16

Gibbs Collapsed Sampling for Latent Dirichlet Allocation on Spark

Zhuolin Qiu, Bin Wu, Bai Wang, Le Yu; PMLR 36:17-28

FAQ: A Framework for Fast Approximate Query Processing on Temporal Data

Udayan Khurana, Srinivasan Parthasarathy, Deepak Turaga; PMLR 36:29-45

Reducing Data Loading Bottleneck with Coarse Feature Vectors for Large Scale Learning

Shingo Takamatsu, Carlos Guestrin; PMLR 36:46-60

A Clustering Algorithm Merging MCMC and EM Methods Using SQL Queries

David Matusevich, Carlos Ordonez; PMLR 36:61-76

A Fast Distributed Stochastic Gradient Descent Algorithm for Matrix Factorization

Fanglin Li, Bin Wu, Liutong Xu, Chuan Shi, Jing Shi; PMLR 36:77-87

The Gamma Operator for Big Data Summarization on an Array DBMS

Carlos Ordonez, Yiqun Zhang, Wellington Cabrera; PMLR 36:88-103

Towards Optimal Execution of Density-based Clustering on Heterogeneous Hardware

Dirk Habich, Stefanie Gahrig, Wolfgang Lehner; PMLR 36:104-119

Scalable Graph Building from Text Data

Thibault Debatty, Pietro Michiardi, Olivier Thonnard, Wim Mees; PMLR 36:120-132

High density-focused uncertainty sampling for active learning over evolving stream data

Dino Ienco, Indrė Žliobaitė, Bernhard Pfahringer; PMLR 36:133-148

iPARAS: Incremental Construction of Parameter Space for Online Association Mining

Xiao Qin, Ramoza Ahsan, Xika Lin, Elke Rundensteiner, Matthew Ward; PMLR 36:149-165

Frequent Subgraph Discovery in Large Attributed Streaming Graphs

Abhik Ray, Larry Holder, Sutanay Choudhury; PMLR 36:166-181

From Tweets to Stories: Using Stream-Dashboard to weave the twitter data stream into dynamic cluster models

Basheer Hawwash, Olfa Nasraoui; PMLR 36:182-197

Ensembles of Adaptive Model Rules from High-Speed Data Streams

João Duarte, João Gama; PMLR 36:198-213

Scalable Heterogeneous Transfer Ranking

Mohammad Taha Bahadori, Yi Chang, Bo Long, Yan Liu; PMLR 36:214-228

subscribe via RSS