The Big Data Guide
FAQs
What is big data?
Big data refers to incredibly large amounts of data (including both structured and unstructured data) that exceeds what traditional data processing software had historically been able to manage through the 1990s and early 2000s (e.g., terabytes vs zettabytes).
What are the "three Vs" of big data?
- **Volume**: This refers to the amount of data being collected.
- **Velocity**: This refers to the speed at which the data is ingested into the system, as well as the speed at which the data changes/updates over time.
- **Variety**: This refers to the various formats the data takes, as well as the different data sources ingested.
What are the different types of big data?
- **Structured data**: Structured data is usually composed of alphanumeric characters translated into a predefined format before being fed into a predetermined data model.
- **Semi-structured data**: Semi-structured data has some level of structure and organization but does not conform to the rigid data schema needed to align with traditional structured data requirements.
- **Unstructured data**: Unstructured data doesn't have consistent structure or schema and can be found in a variety of formats. These formats can include everything from videos, images, and audio files to documents, web logs, sensor data, and binary data.
What are the architecture layers in big data?
- Data sources
- Data collection and ingestion
- Data storage
- Data processing
- Data analytics
- Data visualization
- Data security and monitoring
What are the benefits of big data?
- Parallel computing
- Elastic scalability
- Freedom of choice
- Interoperability
What are the challenges with big data?
- Security
- Complexity
- Evolving technologies
- Specialized skill sets
Get started with Atlas today
Get started in seconds. Our free clusters come with 512 MB of storage so you can play around with sample data and get oriented with our platform.
GET STARTED WITH:
- 125+ regions worldwide
- Sample data sets
- Always-on authentication
- End-to-end encryption
- Command line tools