Channel: security – Cloudera Engineering Blog

X Mark channel Not-Safe-For-Work? cancel confirm NSFW Votes: (0 votes)

X Are you the publisher? Claim or contact us about this channel.

X No ratings yet.

Showing article 9 of 166 in channel 55694027
Channel Details:

Title: security – Cloudera Engineering Blog
Channel Number: 55694027
Language: English
Registered On: October 29, 2015, 7:02 am
Number of Articles: 166
Latest Snapshot: December 22, 2018, 2:38 am
RSS URL: http://blog.cloudera.com/blog/tag/security/feed
Publisher: https://blog.cloudera.com
Description: Best practices, how-tos, use cases, and internals from Cloudera Engineering and the community
Catalog: //internals340.rssing.com/catalog.php?indx=55694027

↧

How-to: Prepare Unstructured Data in Impala for Analysis

September 17, 2015, 8:27 am

≫ Next: How-to: Index Scanned PDFs at Scale Using Fewer Than 50 Lines of Code

≪ Previous: Meet Cloudera’s Apache Spark Committers

Learn how to build an Impala table around data that comes from non-Impala, or even non-SQL, sources.

As data pipelines start to include more aspects such as NoSQL or loosely specified schemas, you might encounter situations where you have data files (particularly in Apache Parquet format) where you do not know the precise table definition. This tutorial shows how you can build an Impala table around data that comes from non-Impala or even non-SQL sources,

The post How-to: Prepare Unstructured Data in Impala for Analysis appeared first on Cloudera Engineering Blog.

↧