In this learning unit, you will:
- Understand DSBulk use cases
- Use DSBulk commands
load
,unload
andcount
- Learn about DSBulk options
-url
,-k
,-t
,-m
and more - Explore several examples of using DSBulk
This scenario is also available on our datastax.com/dev site, where you can find many more resources to help you succeed with Apache Cassandra™.
In this scenario, you learned about:
- DSBulk use cases
- DSBulk commands
load
,unload
andcount
- DSBulk options
-url
,-k
,-t
,-m
and more - Several examples of using DSBulk

Steps
Bulk Loading Large Datasets into Apache Cassandra™
DataStax Bulk Loader
DataStax Bulk Loader (DSBulk) is an efficient, flexible, easy-to-use and free command-line utility for Apache Cassandra™ that excels at loading, unloading and counting data. You should use DSBulk to:
- Load data from CSV or JSON files into the database
- Unload data stored in the database into CSV or JSON files
- Quickly count the number of rows in a given table
DSBulk is a good choice for small, medium and large datasets. It gets data in and out of the database
significantly faster than individual INSERT
s, the COPY
command or other community tools. Only for very large datasets
that reside in a distributed file system, a potentially faster alternative to DSBulk
could be data loading with Apache Spark™.