Q: 12
Government regulations in your industry mandate that you have to maintain an auditable record of access to certain types of data. Assuming that all expiring logs will be archived correctly, where should you store data that is subject to that mandate?
Options
Discussion
Option A makes sense if the main constraint is not raising costs. Pig sits on top of Hadoop, so you can optimize scripts for responsiveness without spinning up more resources, unlike C. Spark (B) works great but often needs infrastructure tweaks or more powerful nodes, which could mean extra spending. Hard to tell unless you know the exact bottleneck, but for exam wording 'no increased cost', A fits best I think. Thoughts?
Probably A here. If you need better responsiveness with no extra cost, Pig can help you optimize MapReduce jobs without extra infra. Spark (B) feels tempting but usually means more resources or complexity. Pretty sure A's what the question wants, but open to other opinions.
A tbh. Pig can help tweak processing without extra infrastructure spend, which fits the "no added cost" part. Spark might be faster but has migration headaches and could impact cost control. Disagree?
A . Tempting to pick B since Spark's fast, but it usually means more cost or infra changes. This one's about optimizing within current setup, so A fits better even if it's not the most modern choice.
Saw a really similar one in my mock, chose B.
Be respectful. No spam.