登录查看更多内容

Aggregation framework and Map Reduce in Mongodb

Anurag Vashishth

Backend Developer | DevOps and Cloud

发布日期: 2021年9月28日

+ 关注

Task Description :- Use Aggression Framework of MongoDB and Create Mapper and Reducer Program.

What is NoSQL ?

NoSQL databases (aka "not only SQL") are non tabular, and store data differently than relational tables. NoSQL databases come in a variety of types based on their data model. The main types are document, key-value, wide-column, and graph. They provide flexible schemas and scale easily with large amounts of data and high user loads.

?What is MongoDB ?

MongoDB?stores data in flexible, JSON-like documents, meaning fields can vary from document to document and data structure can be changed over time
The document model?maps to the objects in your application code, making data easy to work with

?Mongodb is a source-available cross-platform document-oriented database program. Classified as a NoSQL database program, MongoDB uses JSON-like documents with optional schemas.

Ad hoc queries, indexing, and real time aggregation?provide powerful ways to access and analyze your data
MongoDB is a?distributed database at its core, so high availability, horizontal scaling, and geographic distribution are built in and easy to use

What is MongoDB Aggregation Framework ?

Aggregation operations process data records and return computed results. Aggregation operations group values from multiple documents together, and can perform a variety of operations on the grouped data to return a single result. MongoDB provides three ways to perform aggregation: the aggregation pipeline, the map-reduce function, and single purpose aggregation methods.

What is Aggregation Pipeline ?

MongoDB’s aggregation framework is modeled on the concept of data processing pipelines. Documents enter a multi-stage pipeline that transforms the documents into an aggregated result.

What is Map Reduce Function ?

领英推荐

MongoDB—Unleashing the Potential of NoSQL for…

Shakil Khan 6 个月前

SQL vs. NoSQL: Explained

Extern Labs Inc. 1 年前

DBMS Series Part:-2 Sql, NoSql, RDBMS

Naveen chandrawanshi 1 年前

MapReduce is?a software framework and programming model used for processing huge amounts of data. MapReduce program work in two phases, namely, Map and Reduce. Map tasks deal with splitting and mapping of data while Reduce tasks shuffle and reduce the data.

Map-reduce is a data processing paradigm for condensing large volumes of data into useful?aggregated?results.

We will perform this using two MongoDB Aggregation Framework :

Aggregation Pipeline
Map-Reduce Function

Method 1: Aggregation Pipeline

db.countries.aggregate([{$group: {_id: {Language: “$Language”},
totalCountry: {$sum: 1}}}, {$sort: {totalCountry: 1}}])

# {$group: {_id: {Language: "$Language"} -->  group by Language

# totalCountry: {$sum: 1} --> count the total countries asscoiated
with that language

# {$sort: {totalCountry: 1} --> sort them in ascending order

Method 2: Map Reduce Function

var mapFunction = function() { … };
var reduceFunction = function(key, values) { … };
db.runCommand(
 {
 mapReduce: <input-collection>,
 map: mapFunction,
 reduce: reduceFunction,
 out: { merge: <output-collection> },
 query: <query>
 }
 )

Declaring Map variable :

var mapFunc1 = function()  {
  var cntry = emit(this.Language, this.CountryName);  
  $split: [ cntry, "," ];
};
# defined country variable which will be grouping the data based on Language and Country Name and then splitting the data by comma

Declaring Reduce variable :

var ReduceFunc1 = function(keyLang, valuesCountryName) { 
 return valuesCountryName.length;
};
# after grouping, here we are counting the number of countries after the output is been sent by mapper

Using Map Reduce Function :

db.countries.mapReduce(
   mapFunc1,
   ReduceFunc1, 
   {out: "map_reduced"} 
)
# now using map reduce function and saving it in map_reduced collection

!!!!!!!!!!!!!!!!!!!!!!!!!1Thanks for Reading !!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!

要查看或添加评论，请登录

Anurag Vashishth的更多文章

Fortify Your Docker Local Private Registry with SSL Authentication: A Path to Enhanced Security

2023年6月23日

Fortify Your Docker Local Private Registry with SSL Authentication: A Path to Enhanced Security

Hey Everyone, in our last article we setup the docker private registry, Now it's time to see how we can make the…

1 条评论
Docker Private Registry Setup

2023年6月4日

Docker Private Registry Setup

What is Registry? The Registry is a stateless, highly scalable server side application that stores and lets you…

6 条评论
create helm chart

2022年1月29日

create helm chart

Introduction to Kubernetes Helm Charts What is Helm? In simple terms, Helm is a package manager for Kubernetes. Helm is…
Creating Multi-Cloud setup of k8s cluster

2022年1月24日

Creating Multi-Cloud setup of k8s cluster

Nowadays, most applications are using Kubernetes for their deployments. Kubernetes cluster is generally deployed on the…
Auto-Detect Vehicle’s Number Plate Using Python

2022年1月20日

Auto-Detect Vehicle’s Number Plate Using Python

Task Description: → ?? In this task : ??Create a model that will detect a car in a live stream or video and recognize…
Create a Live Streaming Video Chat App without voice using cv2 module of Python.

2022年1月19日

Create a Live Streaming Video Chat App without voice using cv2 module of Python.

what is OpenCV OpenCV-Python is a library of Python bindings designed to solve computer vision problems. .
create unique terraform module

2022年1月19日

create unique terraform module

Task Description ?? Create unique terraform modules and upload on public terraform registry What is Terraform?…
create a web menu using python-CGI And API integration

2022年1月19日

create a web menu using python-CGI And API integration

Task 9.2 Create a Web Menu Using Python-CGI and API integrating all the concepts that have been taught by Vimal sir…
integrating some of the important task

2022年1月19日

integrating some of the important task

In this article we will be integrating some of the very important task so Let' s begin Task Description #1- AWS *1*…
AWS SQS and Its Use-cases

2021年9月28日

AWS SQS and Its Use-cases

Task description : Create an article on case study of AWS SQS. When we start deploying multiple applications, they will…

See all articles

Aggregation framework and Map Reduce in Mongodb

Anurag Vashishth

Backend Developer | DevOps and Cloud

Task Description :- Use Aggression Framework of MongoDB and Create Mapper and Reducer Program.

What is NoSQL ?

?What is MongoDB ?

What is MongoDB Aggregation Framework ?

What is Aggregation Pipeline ?

What is Map Reduce Function ?

领英推荐

We will perform this using two MongoDB Aggregation Framework :

Method 1: Aggregation Pipeline

Method 2: Map Reduce Function

Declaring Map variable :

Using Map Reduce Function :

!!!!!!!!!!!!!!!!!!!!!!!!!1Thanks for Reading !!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!

Anurag Vashishth的更多文章

社区洞察

其他会员也浏览了

MongoDB: A NoSQL Database

SQL vs NoSQL: Picking the Right Side in the Database Showdown

ElasticSearch

Write Path in Key-Value Stores: System Design with Apache Cassandra

SQL vs. NoSQL Databases: Key Differences Explained

Mastering MongoDB CRUD Operations: A Step-by-Step Guide

Introduction to Apache Cassandra: A Distributed NoSQL Database for Big Data

MongoDB - The Best NoSQL Database for Modern Applications

Leveraging Data Science with MongoDB: Unleashing the Potential of NoSQL Technology

The Significance of MongoDB in Contemporary Data Management

Task Description :- Use Aggression Framework of MongoDB and Create Mapper and Reducer Program.

What is NoSQL ?

?What is MongoDB ?

What is MongoDB Aggregation Framework ?

What is Aggregation Pipeline ?

What is Map Reduce Function ?

领英推荐

We will perform this using two MongoDB Aggregation Framework :

Method 1: Aggregation Pipeline

Method 2: Map Reduce Function

Declaring Map variable :

Using Map Reduce Function :

!!!!!!!!!!!!!!!!!!!!!!!!!1Thanks for Reading !!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!

Anurag Vashishth的更多文章

Fortify Your Docker Local Private Registry with SSL Authentication: A Path to Enhanced Security

Docker Private Registry Setup

create helm chart

Creating Multi-Cloud setup of k8s cluster

Auto-Detect Vehicle’s Number Plate Using Python

Create a Live Streaming Video Chat App without voice using cv2 module of Python.

create unique terraform module

create a web menu using python-CGI And API integration

integrating some of the important task

AWS SQS and Its Use-cases

社区洞察

其他会员也浏览了

MongoDB: A NoSQL Database

SQL vs NoSQL: Picking the Right Side in the Database Showdown

ElasticSearch

Write Path in Key-Value Stores: System Design with Apache Cassandra

SQL vs. NoSQL Databases: Key Differences Explained

Mastering MongoDB CRUD Operations: A Step-by-Step Guide

Introduction to Apache Cassandra: A Distributed NoSQL Database for Big Data

MongoDB - The Best NoSQL Database for Modern Applications

Leveraging Data Science with MongoDB: Unleashing the Potential of NoSQL Technology

The Significance of MongoDB in Contemporary Data Management