pandas on spark apply_batch/transform_batch broken? (tl;dr; No – but it isn’t well documented)

12 · Arnon Rotem-Gal-Oz · Oct. 16, 2022, 7:36 a.m.
Using pypark’s pandas integration via apply_batch and transform_batch is very powerful but lacking documentation can cause hard to trace bugs – hopefully my experience (below)… The post pandas on spark apply_batch/transform_batch broken? (tl;dr; No – but it isn’t well documented) appeared first on Cirrus Minor....