-
Article
Use Amazon tools, staff appropriately for big data analytics
Amazon's tools are helpful, but enterprises must also seek out qualified candidates despite a skills gap. Read Now
-
Article
Google's search engine database advances big data
The release of Google Cloud Bigtable, a managed and scalable NoSQL database, added competition for AWS. Read Now
-
Article
Technological look into mining data
Take a thorough look at the different technologies related to big data analytics and how they can help business operations. Read Now
-
Article
Analytics as a service offers infrastructure-free alternative
Big data presents challenges for businesses trying to make use of it. Some enterprises are looking to analytics firms to ease the analytics burden. Read Now
Editor's note
The rise of public cloud has legitimized the data analytics market -- making big data a bigger deal than ever before. Companies that have been collecting terabytes of data for years can now use public cloud as a cost-effective approach to mine and analyze that data. And successful big data analytics strategies often mean a competitive advantage for companies.
Amazon Web Services (AWS) offers a variety of data analytics tools. AWS customers can do everything from process data in real time to implement machine learning for applications. Currently, there are five primary AWS products for cloud-based analytics: Elastic MapReduce (EMR), Kinesis, Redshift, Data Pipeline and Machine Learning.
Third-party tools also exist to diversify and expand on the AWS analytics portfolio. While each service supports big data in its own way, it's key for administrators to understand each offering to ensure proper data integration.
1Accessing, protecting and storing big data
Amazon Web Services makes managing big data easier and more cost effective than ever, with a variety of options to store the petabytes. Amazon's typical slate of products is well equipped for storing big data, including Simple Storage Service (S3) and Elastic Block Store (EBS). But speed is a consideration in data analytics; the faster an enterprise can access its data, the faster it can act on it. Enterprises can access that data more quickly by using a secure NoSQL database, which relies on solid-state hard drives. DynamoDB is a great place to start, though third-party options are available. Amazon Relational Database Service plays a complimentary role to a NoSQL database by offering quick and consistent performance, and is optimized for transactional workloads. Elastic File System can be another useful tool in big data projects, scaling up to handle large flows of data.
-
Article
NoSQL database security drives big data
NoSQL databases handle massive amounts of unstructured data and are growing in demand because of authentication and access controls. Read Now
-
Article
Comparing EFS to other AWS storage options
Like any database option, Elastic File System has its strengths and weaknesses. Learn when to use it for big data and other projects. Read Now
-
Article
Take the Amazon Redshift quiz
Learn more about Amazon's data warehouse service by testing your knowledge with this 10-question quiz. Read Now
2Process big data, and then visualize it
Once you're ready to mine and process data from your databases, there is no shortage of tools to help with that task. In some situations, enterprises need instantaneous information -- such as monetary transactions, social media response and clickstreams. Amazon Kinesis allows users to build a dashboard or application to monitor information as soon as it comes in from the data stream. Kinesis dashboards are one method for visualizing big data, but it might not suit the needs of every business. Third-party options like Tableau offer connectivity to EMR and other AWS products. Being able to see past data and using it to generate predictive algorithms is another challenge. And creating mathematical algorithms to interpret future data can be a tough and time-consuming task. Amazon Machine Learning provides visualization tools and helps create models to react to real-time data.
-
Article
Amazon Elastic MapReduce evolves, supports Apache Spark
Hadoop frameworks are a cornerstone of Elastic MapReduce, but the program is evolving to meet the needs of more customers. Read Now
-
Article
Amazon Kinesis allows for real-time data processing
Kinesis has its strengths and weaknesses. Learn how best to apply the real-time data processing tool to your analytics operation. Read Now
-
Article
When to use Amazon Kinesis -- and when not to use it
Kinesis is a flexible data processing program that can begin in seconds, but that doesn't necessarily mean it's the ideal tool for your enterprise. Read Now
-
Article
AWS Data Pipeline helps manage cloud workflows
Data doesn't serve one solitary purpose. AWS Data Pipeline streamlines data to help identify your cloud workflow. Read Now
-
Article
The benefits and drawbacks of a machine learning service
Machine learning is all about mathematics, and AWS Machine Learning puts the numbers to work for its users. Read Now
-
Podcast
Real-world uses for machine learning
A growing data ecosystem is leading to more businesses using higher levels of compute power. Here's how some of them are using machine learning. Listen Now