Questions
Linux
Laravel
Mysql
Ubuntu
Git
Menu
HTML
CSS
JAVASCRIPT
SQL
PYTHON
PHP
BOOTSTRAP
JAVA
JQUERY
R
React
Kotlin
×
Linux
Laravel
Mysql
Ubuntu
Git
New posts in bigdata
Storing trillions of document similarities
Mar 30, 2019
sql
performance
csv
bigdata
how to fetch all of data from hbase table in spark
Oct 17, 2022
java
mapreduce
hbase
bigdata
apache-spark
How does the Apache Spark scheduler split files into tasks?
May 25, 2022
apache-spark
bigdata
Apache Spark ALS recommendations approach
Apr 03, 2020
apache-spark
machine-learning
bigdata
recommendation-engine
apache-spark-mllib
Spark 2.3 dynamic partitionBy not working on S3 AWS EMR 5.13.0
Nov 15, 2022
scala
apache-spark
amazon-s3
bigdata
amazon-emr
Akka for simulations
Nov 06, 2019
simulation
akka
bigdata
How do I submit a Spark jar to a EMR cluster?
Dec 10, 2019
amazon-web-services
mapreduce
apache-spark
bigdata
emr
R ff package ffsave 'zip' not found
Mar 06, 2022
r
bigdata
ffbase
AWS Glue convert files from JSON to Parquet with same partitions as source table
Sep 19, 2022
amazon-web-services
bigdata
aws-glue
Which data structure to store binary strings and query with hamming distane
May 02, 2018
distance
hamming-distance
bigdata
How does Cassandra store null values?
Oct 02, 2022
cassandra
bigdata
Tips for creating a very large database of hashes
Nov 06, 2022
database
hash
inverted-index
bigdata
Using Twitter Storm to process log data?
Nov 01, 2022
logging
bigdata
apache-storm
Wrapping R's plot function (or ggplot2) to prevent plotting of large data sets
Aug 17, 2022
r
plot
ggplot2
bigdata
Is it possible to run Python's scikit-learn algorithms over Hadoop? [closed]
Oct 22, 2022
python
hadoop
machine-learning
bigdata
scikit-learn
Why does the author proposed the HBase Tall-Thin schema over Short-Wide described inside?
Nov 04, 2022
java
hbase
bigdata
Handling large String lists in java
Sep 28, 2022
java
data-structures
bigdata
hashset
Numpy efficient big matrix multiplication
Jul 24, 2019
python
numpy
matrix
bigdata
pytables
Is it possible to read pdf/audio/video files(unstructured data) using Apache Spark?
May 04, 2022
hadoop
apache-spark
bigdata
Joining a large and a massive spark dataframe
Feb 15, 2022
python
apache-spark
dataframe
pyspark
bigdata
« Newer Entries
Older Entries »