Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Databricks photon vs catalyst Optimizer

I was reading about the new databricks feature of delta engine which is powered by photon. From the understanding i have by reading i have a feeling that its similar to catalyst optimizer. Will the photon replace the catalyst.

How is the photon engine different to catalyst optimizer

The link is referred below

https://databricks.com/blog/2020/06/24/introducing-delta-engine.html

like image 951
Snehasish Das Avatar asked Nov 25 '25 15:11

Snehasish Das


1 Answers

I think you are conflating two things:

  • Catalyst optimizer is about coming up "Steps to take to execute the query". For example, the optimizer will decide how and when to do the join, aggregations, filters etc. When it should do what. This is also called "Physical Plan" in technical parlance. (Catalyst optimizer is more for logical planning but that is a finer detail)
  • The execution engine is actually the engine that carries out the steps decided by the optimizer. It does not second guess the steps, but rather makes sure to carry them out as efficiently as possible.

Delta Engine (or Photon) is an execution engine and not an optimizer. It is more directly comparable with the Tungsten (whole stage code generation) execution engine currently available in open source Spark. Both Tungsten and Delta-Engine are execution engines.

like image 59
Devesh Agrawal Avatar answered Nov 27 '25 03:11

Devesh Agrawal



Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!