I encountered a strange behavior of mongo and I would like to clarify it a bit...
My request is simple as that: I would like to get a size of single document in collection. I found two possible solutions:
Here, I provide some code I perform testing on:
I created new database 'test' and input simple document with only one attribute: type:"auto"
db.test.insert({type:"auto"})
output from stats() function call: db.test.stats() :
{ "ns" : "test.test", "count" : 1, "size" : 40, "avgObjSize" : 40, "storageSize" : 4096, "numExtents" : 1, "nindexes" : 1, "lastExtentSize" : 4096, "paddingFactor" : 1, "systemFlags" : 1, "userFlags" : 0, "totalIndexSize" : 8176, "indexSizes" : { "_id_" : 8176 }, "ok" : 1
}
output from bsonsize function call: Object.bsonsize(db.test.find({test:"auto"}))
481
In the previous call of Object.bsonsize()
, Mongodb returned the size of the cursor, rather than the document.
Correct way is to use this command:
Object.bsonsize(db.test.findOne())
With findOne()
, you can define your query for a specific document:
Object.bsonsize(db.test.findOne({type:"auto"}))
This will return the correct size (in bytes) of the particular document.
I recommended to use this script to get the real size.
db.users.find().forEach(function(obj)
{
var size = Object.bsonsize(obj);
print('_id: '+obj._id+' || Size: '+size+'B -> '+Math.round(size/(1024))+'KB -> '+Math.round(size/(1024*1024))+'MB (max 16MB)');
});
Note: If your IDs are 64-bit integers, the above will truncate the ID value on printing! If that's the case, you can use instead:
db.users.find().forEach(function(obj)
{
var size = Object.bsonsize(obj);
var stats =
{
'_id': obj._id,
'bytes': size,
'KB': Math.round(size/(1024)),
'MB': Math.round(size/(1024*1024))
};
print(stats);
});
This also has the advantage of returning JSON, so a GUI like RoboMongo can tabulate it!
source : https://stackoverflow.com/a/16957505/3933634
edit : thanks to @zAlbee for your suggest completion.
The effective amount of space the document will take in the collection will be more than the size of your document because of the Record Padding mechanism.
This is why there is a difference between the outputs of the db.test.stats()
and Object.bsonsize(..)
.
To get the exact size (in bytes) of the document, stick to the Object.bsonsize()
function.
With mongodb 4.4 (upcoming), You can use bsonSize
operator to get the document size.
db.test.aggregate([
{
"$project": {
"name": 1,
"object_size": { "$bsonSize": "$$ROOT" }
}
}
])
Object.bsonsize(db.test.findOne({type:"auto"})) 它以字节为单位给出。
Method Object.bsonsize()
is available only in legacy mongo
shell. In new mongosh
you have to use package bson
const BSON = require("bson");
BSON.calculateObjectSize({field: "value"})
BSON.calculateObjectSize(db.test.findOne())
The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.