This is a guide to installing MongoDB Compass and connecting to the Nellodee remote server. As well as a brief overview of some simple mongo queries using the Compass GUI.
Navigate to Install MongoDB Compass
Choose the version corresponding to your operating system.
You shouldn't need to install MongoDB itself to connect to a remote instance of MongoDB but I will leave the link to install MongoDB on your local system here: Install MongoDB
If you want to run/test/use MongoDB on your local system, you will need to follow the steps on the link above and make sure you choose the Community Edition corresponding to your operating system.
-
First check to see if the mongod service is already running by using the following command in the terminal
service mongod status
-
If the
Activeproperty says "active (running)" then you can skip to the next step. -
If the
Activeproperty says "inactive (dead)" then you can run the following commandsudo systemctl start mongod
At this point you should have a working version of the MongoDB Compass GUI installed and opened and confirmed that the mongo service has started on nellodee.
Watch the video to see me walk through the steps to connect to the mongo service on nellodee.
mongo_compass_nellodee.mp4
- Click
New Connectionbutton - Click
Advanced Connection Optionsexpansion button - Choose
Proxy/SSHtab - Fill in all four fields:
- SSH Hostname:
nellodee.si.umich.edu - SSH Port:
22 - SSH Username:
your umich uniqname - SSH Password:
password associated with your uniqname
- SSH Hostname:
- Click
Save & Connect - You will be prompted to choose a color and enter a name for your connection
At this point you should be connected to the running mongo service on nellodee and see a list of the databases stored in mongo on nellodee. If you are already familiar with MongoDB queries then you should be good to go.
If not, I will link a cheat sheet to standard mongo queries to get you started.
Each method below can be opened up for a more detailed explanation and examples
Filters the documents to pass only the documents that match the specified condition(s) to the next pipeline stage.
The $match stage has the following prototype form:
{ $match: { <query> } }
$match takes a document that specifies the query conditions. The query syntax is identical to the read operation query syntax; i.e. $match does not accept raw aggregation expressions. Instead, use a $expr query expression to include aggregation expression in $match.
The examples use a collection named articles with the following documents:
{ "_id" : ObjectId("512bc95fe835e68f199c8686"), "author" : "dave", "score" : 80, "views" : 100 }
{ "_id" : ObjectId("512bc962e835e68f199c8687"), "author" : "dave", "score" : 85, "views" : 521 }
{ "_id" : ObjectId("55f5a192d4bede9ac365b257"), "author" : "ahn", "score" : 60, "views" : 1000 }
{ "_id" : ObjectId("55f5a192d4bede9ac365b258"), "author" : "li", "score" : 55, "views" : 5000 }
{ "_id" : ObjectId("55f5a1d3d4bede9ac365b259"), "author" : "annT", "score" : 60, "views" : 50 }
{ "_id" : ObjectId("55f5a1d3d4bede9ac365b25a"), "author" : "li", "score" : 94, "views" : 999 }
{ "_id" : ObjectId("55f5a1d3d4bede9ac365b25b"), "author" : "ty", "score" : 95, "views" : 1000 }
The following operation uses $match to perform a simple equality match:
db.articles.aggregate(
[ { $match : { author : "dave" } } ]
);
The $match selects the documents where the author field equals dave, and the aggregation returns the following:
{ "_id" : ObjectId("512bc95fe835e68f199c8686"), "author" : "dave", "score" : 80, "views" : 100 }
{ "_id" : ObjectId("512bc962e835e68f199c8687"), "author" : "dave", "score" : 85, "views" : 521 }
The `$group` stage separates documents into groups according to a "group key". The output is one document for each unique group key.
A group key is often a field, or group of fields. The group key can also be the result of an expression. Use the _id field in the $group pipeline stage to set the group key. See below for usage examples.
In the $group stage output, the _id field is set to the group key for that document.
The output documents can also contain additional fields that are set using accumulator expressions.
$group does not order its output documents.
The $group stage has the following prototype form:
{
$group:
{
_id: <expression>, // Group key
<field1>: { <accumulator1> : <expression1> },
...
}
}
The <accumulator> operator must be one of the following accumulator operators:
list of valid accumulators
In mongosh create a sample collection named sales with the following documents:
db.sales.insertMany([
{ "_id" : 1, "item" : "abc", "price" : NumberDecimal("10"), "quantity" : NumberInt("2"), "date" : ISODate("2014-03-01T08:00:00Z") },
{ "_id" : 2, "item" : "jkl", "price" : NumberDecimal("20"), "quantity" : NumberInt("1"), "date" : ISODate("2014-03-01T09:00:00Z") },
{ "_id" : 3, "item" : "xyz", "price" : NumberDecimal("5"), "quantity" : NumberInt( "10"), "date" : ISODate("2014-03-15T09:00:00Z") },
{ "_id" : 4, "item" : "xyz", "price" : NumberDecimal("5"), "quantity" : NumberInt("20") , "date" : ISODate("2014-04-04T11:21:39.736Z") },
{ "_id" : 5, "item" : "abc", "price" : NumberDecimal("10"), "quantity" : NumberInt("10") , "date" : ISODate("2014-04-04T21:23:13.331Z") },
{ "_id" : 6, "item" : "def", "price" : NumberDecimal("7.5"), "quantity": NumberInt("5" ) , "date" : ISODate("2015-06-04T05:08:13Z") },
{ "_id" : 7, "item" : "def", "price" : NumberDecimal("7.5"), "quantity": NumberInt("10") , "date" : ISODate("2015-09-10T08:43:00Z") },
{ "_id" : 8, "item" : "abc", "price" : NumberDecimal("10"), "quantity" : NumberInt("5" ) , "date" : ISODate("2016-02-06T20:20:13Z") },
])
The following aggregation operation uses the $group stage to count the number of documents in the sales collection:
db.sales.aggregate( [ { $group: { _id: null, count: { $count: { } } } }] )
The operation returns the following result:
{ "_id" : null, "count" : 8 }
This aggregation operation is equivalent to the following SQL statement:
SELECT COUNT(*) AS count FROM sales
Passes along the documents with the requested fields to the next stage in the pipeline. The specified fields can be existing fields from the input documents or newly computed fields.
The $project stage has the following prototype form:
{ $project: { <specification(s)> } }
The $project takes a document that can specify the inclusion of fields, the suppression of the _id field, the addition of new fields, and the resetting of the values of existing fields. Alternatively, you may specify the exclusion of fields.
The $project specifications have the following forms:
| Form | Description |
|---|---|
<field>: <1 or true> |
Specifies the inclusion of a field. Non-zero integers are also treated as true. |
_id: <0 or false> |
Specifies the suppression of the _id field.To exclude a field conditionally, use the REMOVE variable instead. For details, see Exclude Fields Conditionally. |
<field>: <expression> |
Adds a new field or resets the value of an existing field. If the expression evaluates to $$REMOVE, the field is excluded in the output. For details, see Exclude Fields Conditionally. |
<field>:<0 or false> |
Specifies the exclusion of a field. To exclude a field conditionally, use the REMOVE variable instead. For details, see Exclude Fields Conditionally.If you specify the exclusion of a field other than _id, you cannot employ any other $project specification forms. This restriction does not apply to conditionally exclusion of a field using the REMOVE variable.See also the $unset stage to exclude fields. |
-
The
_idfield is, by default, included in the output documents. To include any other fields from the input documents in the output documents, you must explicitly specify the inclusion in$project. -
If you specify an inclusion of a field that does not exist in the document,
$projectignores that field inclusion and does not add the field to the document.
By default, the _id field is included in the output documents. To exclude the _id field from the output documents, you must explicitly specify the suppression of the _id field in $project.
If you specify the exclusion of a field or fields, all other fields are returned in the output documents.
{ $project: { "<field1>": 0, "<field2>": 0, ... } } // Return all but the specified fields
If you specify the exclusion of a field other than _id, you cannot employ any other $project specification forms: i.e. if you exclude fields, you cannot also specify the inclusion of fields, reset the value of existing fields, or add new fields. This restriction does not apply to conditional exclusion of a field using the REMOVE variable.
See also the $unset stage to exclude fields.
You can use the variable REMOVE in aggregation expressions to conditionally suppress a field. For an example, see Conditionally Exclude Fields.
Consider a books collection with the following document:
{ "_id" : 1, title: "abc123", isbn: "0001122223334", author: { last: "zzz", first: "aaa" }, copies: 5}
The following $project stage includes only the _id, title, and the author fields in its output documents:
db.books.aggregate( [ { $project : { title : 1 , author : 1 } } ] )
The operation results in the following document:
{ "_id" : 1, "title" : "abc123", "author" : { "last" : "zzz", "first" : "aaa" } }
The _id field is always included by default. To exclude the _id field from the output documents of the $project stage, specify the exclusion of the _id field by setting it to 0 in the projection document.
Consider a books collection with the following document:
{ "_id" : 1, title: "abc123", isbn: "0001122223334", author: { last: "zzz", first: "aaa" }, copies: 5}
The following $project stage excludes the _id field but includes the title, and the author fields in its output documents:
db.books.aggregate( [ { $project : { _id: 0, title : 1 , author : 1 } } ] )
The operation results in the following document:
{ "title" : "abc123", "author" : { "last" : "zzz", "first" : "aaa" } }
Consider a books collection with the following document:
{ "_id" : 1, title: "abc123", isbn: "0001122223334", author: { last: "zzz", first: "aaa" }, copies: 5, lastModified: "2016-07-28"}
The following $project stage excludes the lastModified field from the output:
db.books.aggregate( [ { $project : { "lastModified": 0 } } ] )
See also the $unset stage to exclude fields.
Consider a books collection with the following document:
{ "_id" : 1, title: "abc123", isbn: "0001122223334", author: { last: "zzz", first: "aaa" }, copies: 5, lastModified: "2016-07-28"}
The following $project stage excludes the author.first and lastModified fields from the output:
db.books.aggregate( [ { $project : { "author.first" : 0, "lastModified" : 0 } } ] )
Alternatively, you can nest the exclusion specification in a document:
db.bookmarks.aggregate( [ { $project: { "author": { "first": 0}, "lastModified" : 0 } } ] )
Both specifications result in the same output:
{ "_id" : 1, "title" : "abc123", "isbn" : "0001122223334", "author" : { "last" : "zzz" }, "copies" : 5,}
See also the $unset stage to exclude fields.
You can use the variable REMOVE in aggregation expressions to conditionally suppress a field.
Consider a books collection with the following document:
{ "_id" : 1, title: "abc123", isbn: "0001122223334", author: { last: "zzz", first: "aaa" }, copies: 5, lastModified: "2016-07-28"}{ "_id" : 2, title: "Baked Goods", isbn: "9999999999999", author: { last: "xyz", first: "abc", middle: "" }, copies: 2, lastModified: "2017-07-21"}{ "_id" : 3, title: "Ice Cream Cakes", isbn: "8888888888888", author: { last: "xyz", first: "abc", middle: "mmm" }, copies: 5, lastModified: "2017-07-22"}
The following $project stage uses the REMOVE variable to excludes the author.middle field only if it equals "":
db.books.aggregate( [ { $project: { title: 1, "author.first": 1, "author.last" : 1, "author.middle": { $cond: { if: { $eq: [ "", "$author.middle" ] }, then: "$$REMOVE", else: "$author.middle" } } } }] )
The aggregation operation results in the following output:
{ "_id" : 1, "title" : "abc123", "author" : { "last" : "zzz", "first" : "aaa" } }
{ "_id" : 2, "title" : "Baked Goods", "author" : { "last" : "xyz", "first" : "abc" } }{ "_id" : 3, "title" : "Ice Cream Cakes", "author" : { "last" : "xyz", "first" : "abc", "middle" : "mmm" } }
Adds new fields to documents. `$set` outputs documents that contain all existing fields from the input documents and newly added fields.
New in version 4.2.
The $set stage is an alias for $addFields.
Both stages are equivalent to a $project stage that explicitly specifies all existing fields in the input documents and adds the new fields.
$set has the following form:
{ $set: { <newField>: <expression>, ... } }
Specify the name of each field to add and set its value to an aggregation expression. For more information on expressions, see Expressions.
If the name of the new field is the same as an existing field name (including _id), $set overwrites the existing value of that field with the value of the specified expression.
$set appends new fields to existing documents. You can include one or more $set stages in an aggregation operation.
To add field or fields to embedded documents (including documents in arrays) use the dot notation. See example.
To add an element to an existing array field with $set, use with $concatArrays. See example.
Create a sample scores collection with the following:
db.scores.insertMany([ { _id: 1, student: "Maya", homework: [ 10, 5, 10 ], quiz: [ 10, 8 ], extraCredit: 0 }, { _id: 2, student: "Ryan", homework: [ 5, 6, 5 ], quiz: [ 8, 8 ], extraCredit: 8 }])
The following operation uses two $set stages to include three new fields in the output documents:
db.scores.aggregate( [ { $set: { totalHomework: { $sum: "$homework" }, totalQuiz: { $sum: "$quiz" } } }, { $set: { totalScore: { $add: [ "$totalHomework", "$totalQuiz", "$extraCredit" ] } } }] )
The operation returns the following documents:
{ "_id" : 1, "student" : "Maya", "homework" : [ 10, 5, 10 ], "quiz" : [ 10, 8 ], "extraCredit" : 0, "totalHomework" : 25, "totalQuiz" : 18, "totalScore" : 43}{ "_id" : 2, "student" : "Ryan", "homework" : [ 5, 6, 5 ], "quiz" : [ 8, 8 ], "extraCredit" : 8, "totalHomework" : 16, "totalQuiz" : 16, "totalScore" : 40}
Use dot notation to add new fields to embedded documents.
Create a sample collection vehicles with the following:
db.vehicles.insertMany([ { _id: 1, type: "car", specs: { doors: 4, wheels: 4 } }, { _id: 2, type: "motorcycle", specs: { doors: 0, wheels: 2 } }, { _id: 3, type: "jet ski" }])
The following aggregation operation adds a new field fuel_type to the embedded document specs.
db.vehicles.aggregate( [ { $set: { "specs.fuel_type": "unleaded" } }] )
The operation returns the following results:
{ _id: 1, type: "car", specs: { doors: 4, wheels: 4, fuel_type: "unleaded" } }{ _id: 2, type: "motorcycle", specs: { doors: 0, wheels: 2, fuel_type: "unleaded" } }{ _id: 3, type: "jet ski", specs: { fuel_type: "unleaded" } }
Specifying an existing field name in a $set operation causes the original field to be replaced.
Create a sample collection called animals with the following:
db.animals.insertOne( { _id: 1, dogs: 10, cats: 15 } )
The following $set operation overrides the cats field:
db.animals.aggregate( [ { $set: { "cats": 20 } }] )
The operation returns the following document:
{ _id: 1, dogs: 10, cats: 20 }
It is possible to replace one field with another. In the following example the item field substitutes for the _id field.
Create a sample collection called fruits contains the following documents:
db.fruits.insertMany([ { "_id" : 1, "item" : "tangerine", "type" : "citrus" }, { "_id" : 2, "item" : "lemon", "type" : "citrus" }, { "_id" : 3, "item" : "grapefruit", "type" : "citrus" }])
The following aggregration operation uses $set to replace the _id field of each document with the value of the item field, and replaces the item field with a string "fruit".
db.fruits.aggregate( [ { $set: { _id : "$item", item: "fruit" } }] )
The operation returns the following:
{ "_id" : "tangerine", "item" : "fruit", "type" : "citrus" }{ "_id" : "lemon", "item" : "fruit", "type" : "citrus" }{ "_id" : "grapefruit", "item" : "fruit", "type" : "citrus" }
Create a sample scores collection with the following:
db.scores.insertMany([ { _id: 1, student: "Maya", homework: [ 10, 5, 10 ], quiz: [ 10, 8 ], extraCredit: 0 }, { _id: 2, student: "Ryan", homework: [ 5, 6, 5 ], quiz: [ 8, 8 ], extraCredit: 8 }])
You can use $set with a $concatArrays expression to add an element to an existing array field. For example, the following operation uses $set to replace the homework field with a new array whose elements are the current homework array concatenated with another array containing a new score [ 7 ].
db.scores.aggregate([ { $match: { _id: 1 } }, { $set: { homework: { $concatArrays: [ "$homework", [ 7 ] ] } } }])
The operation returns the following:
{ "_id" : 1, "student" : "Maya", "homework" : [ 10, 5, 10, 7 ], "quiz" : [ 10, 8 ], "extraCredit" : 0 }
Create a sample scores collection with the following:
db.scores.insertMany([ { _id: 1, student: "Maya", homework: [ 10, 5, 10 ], quiz: [ 10, 8 ], extraCredit: 0 }, { _id: 2, student: "Ryan", homework: [ 5, 6, 5 ], quiz: [ 8, 8 ], extraCredit: 8 }])
The following aggregation operation adds a new field quizAverage to each document that contains the average of the quiz array.
db.scores.aggregate( [ { $set: { quizAverage: { $avg: "$quiz" } } }] )
The operation returns the following documents:
[ { _id: 1, student: 'Maya', homework: [ 10, 5, 10 ], quiz: [ 10, 8 ], extraCredit: 0, quizAverage: 9 }, { _id: 2, student: 'Ryan', homework: [ 5, 6, 5 ], quiz: [ 8, 8 ], extraCredit: 8, quizAverage: 8 }]
Sorts all input documents and returns them to the pipeline in sorted order.
The $sort stage has the following prototype form:
{ $sort: { <field1>: <sort order>, <field2>: <sort order> ... } }
$sort takes a document that specifies the field(s) to sort by and the respective sort order. <sort order> can have one of the following values:
| Value | Description |
|---|---|
1 |
Sort ascending. |
-1 |
Sort descending. |
{ $meta: "textScore" } |
Sort by the computed textScore metadata in descending order. See Text Score Metadata Sort for an example. |
If sorting on multiple fields, sort order is evaluated from left to right. For example, in the form above, documents are first sorted by <field1>. Then documents with the same <field1> values are further sorted by <field2>.
You can sort on a maximum of 32 keys.
MongoDB does not store documents in a collection in a particular order. When sorting on a field which contains duplicate values, documents containing those values may be returned in any order.
If consistent sort order is desired, include at least one field in your sort that contains unique values. The easiest way to guarantee this is to include the _id field in your sort query.
Consider the following restaurant collection:
db.restaurants.insertMany( [ { "_id" : 1, "name" : "Central Park Cafe", "borough" : "Manhattan"}, { "_id" : 2, "name" : "Rock A Feller Bar and Grill", "borough" : "Queens"}, { "_id" : 3, "name" : "Empire State Pub", "borough" : "Brooklyn"}, { "_id" : 4, "name" : "Stan's Pizzaria", "borough" : "Manhattan"}, { "_id" : 5, "name" : "Jane's Deli", "borough" : "Brooklyn"},] )
The following command uses the $sort stage to sort on the borough field:
db.restaurants.aggregate( [ { $sort : { borough : 1 } } ])
In this example, sort order may be inconsistent, since the borough field contains duplicate values for both Manhattan and Brooklyn. Documents are returned in alphabetical order by borough, but the order of those documents with duplicate values for borough might not the be the same across multiple executions of the same sort. For example, here are the results from two different executions of the above command:
{ "_id" : 3, "name" : "Empire State Pub", "borough" : "Brooklyn" }{ "_id" : 5, "name" : "Jane's Deli", "borough" : "Brooklyn" }{ "_id" : 1, "name" : "Central Park Cafe", "borough" : "Manhattan" }{ "_id" : 4, "name" : "Stan's Pizzaria", "borough" : "Manhattan" }{ "_id" : 2, "name" : "Rock A Feller Bar and Grill", "borough" : "Queens" }{ "_id" : 5, "name" : "Jane's Deli", "borough" : "Brooklyn" }{ "_id" : 3, "name" : "Empire State Pub", "borough" : "Brooklyn" }{ "_id" : 4, "name" : "Stan's Pizzaria", "borough" : "Manhattan" }{ "_id" : 1, "name" : "Central Park Cafe", "borough" : "Manhattan" }{ "_id" : 2, "name" : "Rock A Feller Bar and Grill", "borough" : "Queens" }
While the values for borough are still sorted in alphabetical order, the order of the documents containing duplicate values for borough (i.e. Manhattan and Brooklyn) is not the same.
To achieve a consistent sort, add a field which contains exclusively unique values to the sort. The following command uses the $sort stage to sort on both the borough field and the _id field:
db.restaurants.aggregate( [ { $sort : { borough : 1, _id: 1 } } ])
Since the _id field is always guaranteed to contain exclusively unique values, the returned sort order will always be the same across multiple executions of the same sort.
For the field or fields to sort by, set the sort order to 1 or -1 to specify an ascending or descending sort respectively, as in the following example:
db.users.aggregate( [ { $sort : { age : -1, posts: 1 } } ])
This operation sorts the documents in the users collection, in descending order according by the age field and then in ascending order according to the value in the posts field.
When comparing values of different BSON types in sort operations, MongoDB uses the following comparison order, from lowest to highest:
-
MinKey (internal type)
-
Null
-
Numbers (ints, longs, doubles, decimals)
-
Symbol, String
-
Object
-
Array
-
BinData
-
ObjectId
-
Boolean
-
Date
-
Timestamp
-
Regular Expression
-
MaxKey (internal type)
For details on the comparison/sort order for specific types, see Comparison/Sort Order.
For a pipeline that includes a $text search, you can sort by descending relevance score using the { $meta: "textScore" } expression. In the { <sort-key> } document, set the { $meta: "textScore" } expression to an arbitrary field name. The field name is ignored by the query system. For example:
db.users.aggregate( [ { $match: { $text: { $search: "operating" } } }, { $sort: { score: { $meta: "textScore" }, posts: -1 } } ])
This operation uses the $text operator to match the documents, and then sorts first by the "textScore" metadata in descending order, and then by the posts field in descending order. The score field name in the sort document is ignored by the query system. In this pipeline, the "textScore" metadata is not included in the projection and is not returned as part of the matching documents. See $meta for more information.
Deconstructs an array field from the input documents to output a document for _each_ element. Each output document is the input document with the value of the array field replaced by the element.
_each_ element. Each output document is the input document with the value of the array field replaced by the element.You can pass a field path operand or a document operand to unwind an array field.
You can pass the array field path to $unwind. When using this syntax, $unwind does not output a document if the field value is null, missing, or an empty array.
{ $unwind: <field path> }
When you specify the field path, prefix the field name with a dollar sign $ and enclose in quotes.
You can pass a document to $unwind to specify various behavior options.
{ $unwind: { path: <field path>, includeArrayIndex: <string>, preserveNullAndEmptyArrays: <boolean> }}
| Field | Type | Description |
|---|---|---|
| path | string | Field path to an array field. To specify a field path, prefix the field name with a dollar sign $ and enclose in quotes. |
| includeArrayIndex | string | Optional. The name of a new field to hold the array index of the element. The name cannot start with a dollar sign $. |
| preserveNullAndEmptyArrays | boolean | Optional. - If true, if the path is null, missing, or an empty array, $unwind outputs the document.- If false, if path is null, missing, or an empty array, $unwind does not output a document.The default value is false. |
-
When the operand does not resolve to an array, but is not missing,
null, or an empty array,$unwindtreats the operand as a single element array. -
When the operand is
null, missing, or an empty array$unwindfollows the behavior set for the preserveNullAndEmptyArrays option.
If you specify a path for a field that does not exist in an input document or the field is an empty array, $unwind, by default, ignores the input document and will not output documents for that input document.
To output documents where the array field is missing, null or an empty array, use the preserveNullAndEmptyArrays option.
In mongosh, create a sample collection named inventory with the following document:
db.inventory.insertOne({ "_id" : 1, "item" : "ABC1", sizes: [ "S", "M", "L"] })
The following aggregation uses the $unwind stage to output a document for each element in the sizes array:
db.inventory.aggregate( [ { $unwind : "$sizes" } ] )
The operation returns the following results:
{ "_id" : 1, "item" : "ABC1", "sizes" : "S" }{ "_id" : 1, "item" : "ABC1", "sizes" : "M" }{ "_id" : 1, "item" : "ABC1", "sizes" : "L" }
Each document is identical to the input document except for the value of the sizes field which now holds a value from the original sizes array.
Consider the clothing collection:
db.clothing.insertMany([ { "_id" : 1, "item" : "Shirt", "sizes": [ "S", "M", "L"] }, { "_id" : 2, "item" : "Shorts", "sizes" : [ ] }, { "_id" : 3, "item" : "Hat", "sizes": "M" }, { "_id" : 4, "item" : "Gloves" }, { "_id" : 5, "item" : "Scarf", "sizes" : null }])
$unwind treats the sizes field as a single element array if:
-
the field is present,
-
the value is not null, and
-
the value is not an empty array.
Expand the sizes arrays with $unwind:
db.clothing.aggregate( [ { $unwind: { path: "$sizes" } } ] )
The $unwind operation returns:
{ _id: 1, item: 'Shirt', sizes: 'S' },{ _id: 1, item: 'Shirt', sizes: 'M' },{ _id: 1, item: 'Shirt', sizes: 'L' },{ _id: 3, item: 'Hat', sizes: 'M' }
-
In document
"_id": 1,sizesis a populated array.$unwindreturns a document for each element in thesizesfield. -
In document
"_id": 3,sizesresolves to a single element array. -
Documents
"_id": 2, "_id": 4, and"_id": 5do not return anything because thesizesfield cannot be reduced to a single element array.
The { path: <FIELD> } syntax is optional. The following $unwind operations are equivalent.
db.clothing.aggregate( [ { $unwind: "$sizes" } ] )db.clothing.aggregate( [ { $unwind: { path: "$sizes" } } ] )
The preserveNullAndEmptyArrays and includeArrayIndex examples use the following collection:
db.inventory2.insertMany([ { "_id" : 1, "item" : "ABC", price: NumberDecimal("80"), "sizes": [ "S", "M", "L"] }, { "_id" : 2, "item" : "EFG", price: NumberDecimal("120"), "sizes" : [ ] }, { "_id" : 3, "item" : "IJK", price: NumberDecimal("160"), "sizes": "M" }, { "_id" : 4, "item" : "LMN" , price: NumberDecimal("10") }, { "_id" : 5, "item" : "XYZ", price: NumberDecimal("5.75"), "sizes" : null }])
The following $unwind operation uses the preserveNullAndEmptyArrays option to include documents whose sizes field is null, missing, or an empty array.
db.inventory2.aggregate( [ { $unwind: { path: "$sizes", preserveNullAndEmptyArrays: true } }] )
The output includes those documents where the sizes field is null, missing, or an empty array:
{ "_id" : 1, "item" : "ABC", "price" : NumberDecimal("80"), "sizes" : "S" }{ "_id" : 1, "item" : "ABC", "price" : NumberDecimal("80"), "sizes" : "M" }{ "_id" : 1, "item" : "ABC", "price" : NumberDecimal("80"), "sizes" : "L" }{ "_id" : 2, "item" : "EFG", "price" : NumberDecimal("120") }{ "_id" : 3, "item" : "IJK", "price" : NumberDecimal("160"), "sizes" : "M" }{ "_id" : 4, "item" : "LMN", "price" : NumberDecimal("10") }{ "_id" : 5, "item" : "XYZ", "price" : NumberDecimal("5.75"), "sizes" : null }
The following $unwind operation uses the includeArrayIndex option to include the array index in the output.
db.inventory2.aggregate( [ { $unwind: { path: "$sizes", includeArrayIndex: "arrayIndex" } }])
The operation unwinds the sizes array and includes the array index in the new arrayIndex field. If the sizes field does not resolve to a populated array but is not missing, null, or an empty array, the arrayIndex field is null.
{ "_id" : 1, "item" : "ABC", "price" : NumberDecimal("80"), "sizes" : "S", "arrayIndex" : NumberLong(0) }{ "_id" : 1, "item" : "ABC", "price" : NumberDecimal("80"), "sizes" : "M", "arrayIndex" : NumberLong(1) }{ "_id" : 1, "item" : "ABC", "price" : NumberDecimal("80"), "sizes" : "L", "arrayIndex" : NumberLong(2) }{ "_id" : 3, "item" : "IJK", "price" : NumberDecimal("160"), "sizes" : "M", "arrayIndex" : null }
In mongosh, create a sample collection named inventory2 with the following documents:
db.inventory2.insertMany([ { "_id" : 1, "item" : "ABC", price: NumberDecimal("80"), "sizes": [ "S", "M", "L"] }, { "_id" : 2, "item" : "EFG", price: NumberDecimal("120"), "sizes" : [ ] }, { "_id" : 3, "item" : "IJK", price: NumberDecimal("160"), "sizes": "M" }, { "_id" : 4, "item" : "LMN" , price: NumberDecimal("10") }, { "_id" : 5, "item" : "XYZ", price: NumberDecimal("5.75"), "sizes" : null }])
The following pipeline unwinds the sizes array and groups the resulting documents by the unwound size values:
db.inventory2.aggregate( [ // First Stage { $unwind: { path: "$sizes", preserveNullAndEmptyArrays: true } }, // Second Stage { $group: { _id: "$sizes", averagePrice: { $avg: "$price" } } }, // Third Stage { $sort: { "averagePrice": -1 } }] )
First Stage:
The $unwind stage outputs a new document for each element in the sizes array. The stage uses the preserveNullAndEmptyArrays option to include in the output those documents where sizes field is missing, null or an empty array. This stage passes the following documents to the next stage:
{ "_id" : 1, "item" : "ABC", "price" : NumberDecimal("80"), "sizes" : "S" }{ "_id" : 1, "item" : "ABC", "price" : NumberDecimal("80"), "sizes" : "M" }{ "_id" : 1, "item" : "ABC", "price" : NumberDecimal("80"), "sizes" : "L" }{ "_id" : 2, "item" : "EFG", "price" : NumberDecimal("120") }{ "_id" : 3, "item" : "IJK", "price" : NumberDecimal("160"), "sizes" : "M" }{ "_id" : 4, "item" : "LMN", "price" : NumberDecimal("10") }{ "_id" : 5, "item" : "XYZ", "price" : NumberDecimal("5.75"), "sizes" : null }
Second Stage:
The $group stage groups the documents by sizes and calculates the average price of each size. This stage passes the following documents to the next stage:
{ "_id" : "S", "averagePrice" : NumberDecimal("80") }{ "_id" : "L", "averagePrice" : NumberDecimal("80") }{ "_id" : "M", "averagePrice" : NumberDecimal("120") }{ "_id" : null, "averagePrice" : NumberDecimal("45.25") }
Third Stage:
The $sort stage sorts the documents by averagePrice in descending order. The operation returns the following result:
{ "_id" : "M", "averagePrice" : NumberDecimal("120") }{ "_id" : "L", "averagePrice" : NumberDecimal("80") }{ "_id" : "S", "averagePrice" : NumberDecimal("80") }{ "_id" : null, "averagePrice" : NumberDecimal("45.25") }
In mongosh, create a sample collection named sales with the following documents:
db.sales.insertMany([ { _id: "1", "items" : [ { "name" : "pens", "tags" : ["writing", "office", "school", "stationary" ], "price" : NumberDecimal("12.00"), "quantity" : NumberInt("5") }, { "name" : "envelopes", "tags" : [ "stationary", "office" ], "price" : NumberDecimal("19.95"), "quantity" : NumberInt("8") } ] }, { _id: "2", "items" : [ { "name" : "laptop", "tags" : [ "office", "electronics" ], "price" : NumberDecimal("800.00"), "quantity" : NumberInt("1") }, { "name" : "notepad", "tags" : [ "stationary", "school" ], "price" : NumberDecimal("14.95"), "quantity" : NumberInt("3") } ] }])
The following operation groups the items sold by their tags and calculates the total sales amount per each tag.
db.sales.aggregate([ // First Stage { $unwind: "$items" }, // Second Stage { $unwind: "$items.tags" }, // Third Stage { $group: { _id: "$items.tags", totalSalesAmount: { $sum: { $multiply: [ "$items.price", "$items.quantity" ] } } } }])
First Stage
The first $unwind stage outputs a new document for each element in the items array:
{ "_id" : "1", "items" : { "name" : "pens", "tags" : [ "writing", "office", "school", "stationary" ], "price" : NumberDecimal("12.00"), "quantity" : 5 } }{ "_id" : "1", "items" : { "name" : "envelopes", "tags" : [ "stationary", "office" ], "price" : NumberDecimal("19.95"), "quantity" : 8 } }{ "_id" : "2", "items" : { "name" : "laptop", "tags" : [ "office", "electronics" ], "price" : NumberDecimal("800.00"), "quantity" : 1 } }{ "_id" : "2", "items" : { "name" : "notepad", "tags" : [ "stationary", "school" ], "price" : NumberDecimal("14.95"), "quantity" : 3 } }
Second Stage
The second $unwind stage outputs a new document for each element in the items.tags arrays:
{ "_id" : "1", "items" : { "name" : "pens", "tags" : "writing", "price" : NumberDecimal("12.00"), "quantity" : 5 } }{ "_id" : "1", "items" : { "name" : "pens", "tags" : "office", "price" : NumberDecimal("12.00"), "quantity" : 5 } }{ "_id" : "1", "items" : { "name" : "pens", "tags" : "school", "price" : NumberDecimal("12.00"), "quantity" : 5 } }{ "_id" : "1", "items" : { "name" : "pens", "tags" : "stationary", "price" : NumberDecimal("12.00"), "quantity" : 5 } }{ "_id" : "1", "items" : { "name" : "envelopes", "tags" : "stationary", "price" : NumberDecimal("19.95"), "quantity" : 8 } }{ "_id" : "1", "items" : { "name" : "envelopes", "tags" : "office", "price" : NumberDecimal("19.95"), "quantity" : 8 } }{ "_id" : "2", "items" : { "name" : "laptop", "tags" : "office", "price" : NumberDecimal("800.00"), "quantity" : 1 } }{ "_id" : "2", "items" : { "name" : "laptop", "tags" : "electronics", "price" : NumberDecimal("800.00"), "quantity" : 1 } }{ "_id" : "2", "items" : { "name" : "notepad", "tags" : "stationary", "price" : NumberDecimal("14.95"), "quantity" : 3 } }{ "_id" : "2", "items" : { "name" : "notepad", "tags" : "school", "price" : NumberDecimal("14.95"), "quantity" : 3 } }
Third Stage
The $group stage groups the documents by the tag and calculates the total sales amount of items with each tag:
{ "_id" : "writing", "totalSalesAmount" : NumberDecimal("60.00") }{ "_id" : "stationary", "totalSalesAmount" : NumberDecimal("264.45") }{ "_id" : "electronics", "totalSalesAmount" : NumberDecimal("800.00") }{ "_id" : "school", "totalSalesAmount" : NumberDecimal("104.85") }{ "_id" : "office", "totalSalesAmount" : NumberDecimal("1019.60") }
Takes the documents returned by the aggregation pipeline and writes them to a specified collection. Starting in MongoDB 4.4, you can specify the output database.
The $out stage must be the last stage in the pipeline. The $out operator lets the aggregation framework return result sets of any size.
$out replaces the specified collection if it exists. See Replace Existing Collection for details.
The $out stage has the following syntax:
-
Starting in MongoDB 4.4,
$outcan take a document to specify the output database as well as the output collection:{ $out: { db: "<output-db>", coll: "<output-collection>" } }Field Description db The output database name.
- For a replica set or a standalone, if the output database does not exist,$outalso creates the database.
- For a sharded cluster, the specified output database must already exist.coll The output collection name. -
$outcan take a string to specify only the output collection (i.e. output to a collection in the same database):{ $out: "<output-collection>" } // Output collection is in the same database