Posted by on
Categories: Hadoop

SAN JOSE, CA–(Marketwired – July 05, 2016) – #WaterlineData, The Smart Data Catalog Company, today announced it was cited in Gartner’s June 2016 Comparing Four Hadoop Integration Architectures. According to Gartner Research Director Mei Yang Selvage, ” #Hadoop is the backbone for big data analytics, but integration remains a key challenge to its success. Integration specialists, architects and IT managers should evaluate four architectures-batch, data virtualization, replication and messaging-to reap the maximal return for your Hadoop.” In addition, the report states, “Managing metadata in Hadoop is no longer optional because the sizes of data lakes are growing rapidly. Metadata is descriptive or structural information about stored data. Gartner predicts that 90% of deployed data lakes will be useless because they are overwhelmed with captured information. To navigate massive information, metadata management is the key to turning information into actionable insights.” Waterline Data was the only purpose-built, Hadoop-specific metadata tool cited in the report. Waterline Data allows organizations to replace manual tagging of metadata with an automated process that rapidly classifies the data assets in their data lake, including new data even as it’s created. With the company’s Smart Data Catalog, data engineers, data scientists and business analysts get secure self-service access to trusted, high quality data for faster discovery, understanding, use and governance. Unlike catalogs that scan historical SQL logs, Waterline Data automatically catalogs every field of data in the data lake while capturing and learning from tribal knowledge.

Leave a Reply

This site uses Akismet to reduce spam. Learn how your comment data is processed.