Much of the data in a typical Hadoop cluster is text data, so enterprises will want to take advantage of it to gain a more complete picture of what's happening with their customers and with operations.
Seattle, WA (PRWEB) March 12, 2014
TDWI Research has released its newest Checklist Report, Utilizing Big Data Analytics with Hadoop. The report examines how enterprises can gain competitive advantage using advanced analytics and how several technologies are coming together to form the fabric of an analytics ecosystem.
Increasingly, companies are dealing with larger amounts of diverse data, some generated by newer sources (such as smartphones) and much of it unstructured (such as machine-generated data). In addition, as big data gets bigger, companies are looking at technologies that can handle the deluge, including Hadoop, an inexpensive solution for storing and processing big data. “[Hadoop] is rapidly becoming an important part of the big data ecosystem,” writes the report’s author, Fern Halper.
“Advances in analytics algorithms and analytics processing have also helped organizations cope. Visualization has helped companies explore data to discover insights—even with big data,” Halper explains. “Analytics algorithms such as machine learning and predictive analytics have matured to support the distributed processing needed for big data analytics.” Text analytics is also helping enterprises derive new meaning from unstructured data.
Halper begins the report by discussing the fundamentals of Hadoop and provides an overview of the collection of tools that exploit its power. She also explains key technologies that accelerate processing and return answers more quickly to business users, such as in-memory analytics.
The Checklist Report explores the role of ETL (extract, transform, and load) and data preparation to enable big data analysis, techniques for enhancing big data exploration and insight discovery (such as visualization techniques and descriptive statistics), and advances in analytics (including text analytics and other data mining techniques).
“Our report helps enterprises understand how text data fits into the analytics mix—including e-mail messages, call center notes, tweets, and blogs,” says Halper. “Much of the data in a typical Hadoop cluster is text data, so enterprises will want to take advantage of it to gain a more complete picture of what is happening with their customers and with operations.” Of course, these new technologies will take a new skill set; Halper looks at what skills data scientists will need to master to take advantage of big data.
This research was sponsored by SAS.
For a complete copy of the report or to ask questions of the author, members of the press should contact Fern Halper at fhalper(at)tdwi(dot)org.
The report is freely downloadable by the public at http://tdwi.org/research/2014/03/Checklist-Utilizing-Big-Data-Analytics-with-Hadoop; a short registration is required for those downloading a TDWI report for the first time.
About the Author
Fern Halper is director of TDWI Research for advanced analytics, focusing on predictive analytics, social media analysis, text analytics, cloud computing, and other “big data” analytics approaches. She has more than 20 years of experience in data and business analysis, and has published numerous articles on data mining and information technology. Halper is co-author of "Dummies" books on cloud computing, hybrid cloud, service-oriented architecture, and service management, and big data. She has been a partner at industry analyst firm Hurwitz & Associates and a lead analyst for Bell Labs. Her Ph.D. is from Texas A&M University. You can reach her at fhalper(at)tdwi(dot)org, or follow her on Twitter: @fhalper.
TDWI, a division of 1105 Media, Inc., is the premier provider of in-depth, high-quality education and research in the business intelligence and data warehousing industry. TDWI is dedicated to educating business and information technology professionals about the best practices, strategies, techniques, and tools required to successfully design, build, maintain, and enhance business intelligence and data warehousing solutions. TDWI also fosters the advancement of business intelligence and data warehousing research and contributes to knowledge transfer and the professional development of its members. TDWI offers a worldwide membership program, five major educational conferences, topical educational seminars, role-based training, on-site courses, certiﬁcation, solution provider partnerships, an awards program for best practices, live Webinars, resourceful publications, and an in-depth research program. For more information, visit tdwi.org or follow us on Twitter @TDWI.
About 1105 Media
1105 Media, Inc., is a leading provider of integrated information and media in targeted business-to-business markets, including specialized sectors of the information technology community; industrial health, safety, and compliance; security; environmental protection; and home healthcare. 1105's offerings span print and online magazines, journals, and newsletters; seminars, conferences, and trade shows; training courseware; and Web-based services. 1105 Media is based in Chatsworth, CA, with offices throughout the United States.
Fern Halper, TDWI