PinnedSpark Shuffle and Best Practices for Performance TuningShuffle is the most fundamental process in Spark. Data Shuffling is a process where data is redistributed across different partitions…May 17May 17
PinnedBiotecnologia, Big Data e Aprendizagem ArtificialA biotecnologia e a tecnologia da informação se desenvolveram muito nos últimos anos, esses avanços ocorrem em paralelo. O avanço…Nov 4, 2020Nov 4, 2020
What I’ve learned as a Marketing Data EngineerA brief review about the main marketing APIs and marketing data features that influence standard data pipeline concepts.Nov 6, 2022Nov 6, 2022