Difficulty: beginner
Estimated Time: 15 minutes

The MapR Data Platform integrates Apache Hadoop, Apache Spark, and Apache Drill with real-time database capabilities, global event streaming, and scalable enterprise storage to power a new generation of Big Data applications. MapR solves the challenges of complex data environments by managing data and its ecosystem across multiple clouds and containerized infrastructures.

In this scenario you will become familiar with the MapR data platform by interacting with a single-node MapR cluster.

In this scenario you saw how MapR combines Hadoop, Spark, and Apache Drill with a distributed file system, distributed database, and distributed event streaming, all on a single cluster. This improves performance and lowers hardware costs for Big Data applications. The MapR Data Platform allows you to manage your data with any tooling on any infrastructure.

Would you like to learn more about MapR? Check out our blog, In Search of a Data Platform.

If you'd like to speak with MapR, contact us!

Introduction to MapR

Step 1 of 5

Step 1 - Login

The MapR data platform is a single system that allows you to access data using three distinct ways. You can store data in conventional files, as message streams or as noSQL tables. All of your data is arranged in directories and sub-directories just like in a conventional file system and all are subject to consistent controls on who can see, access or update what.

You can learn more by looking at these following links:

  1. MapR XD Distributed File and Object Store
  2. MapR Database
  3. MapR Event Store for Apache Kafka

In this tutorial you will explore each way to store and access data on a single-node MapR cluster.

Before you begin you need to tell the system who you are. Run the following command to login:

maprlogin password -user mapr

This is a toy cluster that won't last long, so you can use the password mapr. In a real cluster you would authenticate using Kerberos granted credentials or using a realistically complex password.

Verify that you've authenticated yourself to the cluster by running the following maprcli command: maprcli node list -columns ip.

MapR can be administered using the maprcli command (like we just did) or with a web interface known as the MapR Control System (MCS). Click here to open MCS, and login with these credentials:

Username: mapr

Password: mapr

When you're finished exploring MCS, return to the Terminal tab and click the Continue button.