Questions
Linux
Laravel
Mysql
Ubuntu
Git
Menu
HTML
CSS
JAVASCRIPT
SQL
PYTHON
PHP
BOOTSTRAP
JAVA
JQUERY
R
React
Kotlin
×
Linux
Laravel
Mysql
Ubuntu
Git
New posts in bigdata
Parquet predicate pushdown
Sep 12, 2022
hadoop
apache-spark
parquet
bigdata
Is Data Lake and Big Data the same?
Jul 27, 2022
bigdata
data-lake
Apache Hadoop vs Google Bigdata
Oct 29, 2022
hadoop
comparison
hdfs
bigdata
gfs
Mini batch-training of a scikit-learn classifier where I provide the mini batches
Sep 05, 2022
python
scikit-learn
bigdata
NumPy reading file with filtering lines on the fly
Oct 25, 2022
python
input
numpy
large-files
bigdata
How to do a join in Elasticsearch -- or at the Lucene level
Sep 10, 2022
join
lucene
nosql
elasticsearch
bigdata
pyspark: counter part of like() method in dataframe
Aug 13, 2022
apache-spark
spark-dataframe
pyspark-sql
bigdata
Can large datasets be used with Excel 2013? [closed]
Aug 31, 2022
excel
bigdata
excel-2013
What do I need to know about working with huge databases?
Oct 27, 2022
sql
database
database-design
bigdata
Extend numpy mask by n cells to the right for each bad value, efficiently
Oct 07, 2022
python
numpy
bigdata
It appears I've run out of 32-bit address space. What are my options?
Sep 15, 2019
python
numpy
bigdata
Apache Spark: impact of repartitioning, sorting and caching on a join
Nov 04, 2022
apache-spark
pyspark
bigdata
azure-databricks
delta-lake
Processing a very large text file with lazy Texts and ByteStrings
Mar 14, 2021
haskell
text
hashmap
bigdata
file-processing
Send KafkaProducer from local machine to hortonworks sandbox on virtualbox
Jul 20, 2020
hadoop
bigdata
apache-kafka
hortonworks-data-platform
Implementing custom Spark RDD in Java
Mar 15, 2022
apache-spark
bigdata
Spark Scala Understanding reduceByKey(_ + _)
Oct 14, 2022
scala
apache-spark
word-count
bigdata
How to process a range of hbase rows using spark?
Apr 01, 2022
java
hadoop
bigdata
apache-spark
Pyspark: how to duplicate a row n time in dataframe?
Sep 10, 2022
python
pyspark
bigdata
In spark join, does table order matter like in pig?
Oct 16, 2022
hadoop
apache-spark
apache-pig
bigdata
Creating a comparable and flexible fingerprint of an object
Feb 20, 2021
c#
sql
algorithm
data-mining
bigdata
« Newer Entries
Older Entries »