Frage im Vorstellungsgespräch bei Dataiku

How to sort a large dataset within constrained memory limits

Antwort im Vorstellungsgespräch

Anonym

2. Aug. 2025

I proposed and discussed an external merge sort approach: - Split the input file into smaller chunks that fit into memory - Sort each chunk individually and store them in temporary files - Merge the sorted chunks to produce the final output