Questions Linux Laravel Mysql Ubuntu Git Menu

HTML CSS JAVASCRIPT SQL PYTHON PHP BOOTSTRAP JAVA JQUERY R React Kotlin

New posts in bigdata

Spark, delta lake auto schema evolution for nested columns

Feb 03, 2026

apache-spark pyspark bigdata delta-lake

Transforming one row into many rows using Amazon Glue

Feb 02, 2026

apache-spark pyspark bigdata aws-glue

Pentaho Data Integration (PDI) 9.4 Marketplace missing, how to install Plugin now?

Jan 28, 2026

plugins bigdata pentaho pentaho-spoon pentaho-data-integration

What is the difference between the hive metastore in derby vs the one in hive/warehouse?

Jan 28, 2026

hadoop hive bigdata

How to train a Keras model with very a big dataset?

Jan 28, 2026

python keras bigdata autoencoder unsupervised-learning

Matching many files against many patterns in Java

Jan 22, 2026

java string algorithm pattern-matching bigdata

Hadoop: How to collect output of Reduce into a Java HashMap

Jan 22, 2026

hadoop mapreduce bigdata similarity cascading

Sqoop import job fails due to task timeout

Jan 20, 2026

hadoop bigdata sqoop

Neo4j's MERGE command on big datasets

Jan 03, 2026

merge neo4j bigdata nodes graph-databases

Data Modelling for Big Data

Jan 02, 2026

graph hive google-bigquery arangodb bigdata

Plot subplots from a very large file in gnuplot

Jan 01, 2026

plot gnuplot bigdata

What is the ideal format to store large results generated by R?

Dec 31, 2025

r bigdata mclapply

Read JSON files from multiple line file in spark scala

Dec 30, 2025

json scala apache-spark bigdata

Calculating unique URLs in a huge dataset (150+ billions)

Dec 23, 2025

java bigdata

Hive - Out of Memory Exception - Java Heap Space

Dec 23, 2025

hadoop hive bigdata

Connect to Spark running on VM

Dec 23, 2025

apache-spark virtualbox bigdata

« Newer Entries Older Entries »