Fragen aus Vorstellungsgesprächen für data scientist, von Bewerbern geteilt
The first round you will asked the following question. 1. Explain about the tera sort that happens between the map and reduce phase in hadoop job. 2. Explain about class loader, can you have more than one class loader in jvm. 3. Explain how indexing in implemented in sql databases, be prepared to take about b-tree and tree balancing algorithm. 4. How does java guarantee the hashmap look up in O(1). Round 2 1. You will asked to write java streaming code to parse infinite input while keeping memory low the output should predefined length of random sample.