Realtime Processing With Apache Spark - Study Mode

[#56] Spark architecture is . . . . . . . . times as fast as Hadoop disk-based Apache Mahout and even scales better than Vowpal Wabbit.
Correct Answer

(A) 10

[#57] Sally in data processing uses . . . . . . . . to cleanse and prepare the data.
Correct Answer

(A) Pig

[#58] Groom servers starts up with a . . . . . . . . instance and an RPC proxy to contact the bsp master.
Correct Answer

(B) BSPPeer

[#59] . . . . . . . . is used with Pig scripts to write data to HCatalog-managed tables.
Correct Answer

(C) HCatStorer

[#60] Mahout provides an implementation of a . . . . . . . . identification algorithm which scores collocations using log-likelihood ratio.
Correct Answer

(A) collocation