What should you do?
 
Notifications
Clear all

What should you do?

1 Posts
1 Users
0 Likes
152 Views
(@leneaveorval)
Noble Member
Joined: 2 years ago
Posts: 705
Topic starter  

You plan to implement an Azure Data Lake Storage Gen2 container that will contain CSV files. The size of the files will vary based on the number of events that occur per hour.

File sizes range from 4.KB to 5 GB.

You need to ensure that the files stored in the container are optimized for batch processing.

What should you do?

  • A . Compress the files.
  • B . Merge the files.
  • C . Convert the files to JSON
  • D . Convert the files to Avro.

Show Answer Hide Answer

Suggested Answer: D

Explanation:

Avro supports batch and is very relevant for streaming.

Note: Avro is framework developed within Apache’s Hadoop project. It is a row-based storage format which is widely used as a serialization process. AVRO stores its schema in JSON format making it easy to read and interpret by any program. The data itself is stored in binary format by doing it compact and efficient.

Reference: https://www.adaltas.com/en/2020/07/23/benchmark-study-of-different-file-format/

   
Quote

Latest Microsoft DP-203 Dumps Valid Version

Latest And Valid Q&A | Instant Download | Once Fail, Full Refund
Share: