Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

New posts in bigdata

Using Twitter Storm to process log data?

Wrapping R's plot function (or ggplot2) to prevent plotting of large data sets

r plot ggplot2 bigdata

Is it possible to run Python's scikit-learn algorithms over Hadoop? [closed]

Why does the author proposed the HBase Tall-Thin schema over Short-Wide described inside?

java hbase bigdata

Handling large String lists in java

Numpy efficient big matrix multiplication

Is it possible to read pdf/audio/video files(unstructured data) using Apache Spark?

hadoop apache-spark bigdata

Joining a large and a massive spark dataframe

Stream processing architecture

Generating a very large matrix of string combinations using combn() and bigmemory package

r combinatorics bigdata

doing PCA on very large data set in R

r bigdata pca

What is the best way to load huge result set in memory?

c# ado.net bigdata datareader

NumPy: 3-byte, 6-byte types (aka uint24, uint48)

python numpy bigdata

NoSQL or RDBMS for audit data

Is there a good way to avoid memory deep copy or to reduce time spent in multiprocessing?

Social-networking: Hadoop, HBase, Spark over MongoDB or Postgres?

What is the difference between broadcast_address and broadcast_rpc_address in cassandra.yaml?

cassandra bigdata

Getting exception : java.lang.NoSuchMethodError: scala.reflect.api.JavaUniverse.runtimeMirror(Ljava/lang/ClassLoader;) while using data frames

Representation of Large Graph with 100 million nodes in C++

c++ vector graph bigdata

How multiple consumer group consumers work across partition on the same topic in Kafka?

apache-kafka bigdata