MongoDB，PyMongo 如何按唯一字段计数过滤结果？

Question

MongoDb contains next set of data MongoDb 包含下一组数据

[{"user": "a", "domain": "some.com"},
{"user": "b", "domain": "some.com"},
{"user": "b1", "domain": "some.com"},
{"user": "c", "domain": "test.com"},
{"user": "d", "domain": "work.com"},
{"user": "aaa", "domain": "work.com"},
{"user": "some user", "domain": "work.com"} ]

I need select first items filtered by domain, no more that 2 same domains in result.我需要 select 由域过滤的第一项，结果中没有超过 2 个相同的域。 After mongo query result should looks like在 mongo 查询结果应该看起来像之后

[{"user": "a", "domain": "some.com"},
{"user": "b", "domain": "some.com"},
{"user": "c", "domain": "test.com"},
{"user": "d", "domain": "work.com"},
{"user": "aaa", "domain": "work.com"}]

Just 2 results with same domain, other with same domains must be skipped.只有 2 个具有相同域的结果，必须跳过具有相同域的其他结果。 Is this possible do do with $aggregation, $filter or something else?这可能与 $aggregation、$filter 或其他东西有关吗？

Is the a way to group by domain and get just first N(2 in example) users data?是一种按域分组并仅获取前 N（例如 2 个）用户数据的方法吗？ Example:例子：

[{"domain": "some.com", "users": [a, b]}]

so所以

{"user": "b1", "domain": "some.com"} will be skip

Answer 1

You may get desired result performing MongoDB aggregation.执行 MongoDB 聚合可能会得到所需的结果。

It consists in four stages:它包括四个阶段：
1. We group by domain field and accumulate into data documents with the same domain name 1.我们按domain字段分组，积累成同域名的data文档
2. Than, we splice array to set max 2 items per domain 2.然后，我们拼接数组以设置每个域最多 2 个项目
3. We flatten data field with $unwind operator 3. 我们使用$unwind操作符展平data字段
4. We return original document structure with $replaceRoot operator 4. 我们用$replaceRoot操作符返回原始文档结构

db.collection.aggregate([
  {
    "$group": {
      "_id": "$domain",
      "data": { "$push": "$$ROOT" }
    }
  },
  {
    "$addFields": {
     "data": {
        "$slice": [ "$data", 0, 2 ]
      }
    }
  },
  {
    "$unwind": "$data"
  },
  {
    $replaceRoot: { "newRoot": "$data" }
  }
])

MongoPlayground | Mongo游乐场| Pymongo Aggregation Pymongo 聚合

MongoDB，PyMongo 如何按唯一字段计数过滤结果？

问题描述

1 个解决方案

解决方案1
1 已采纳 2020-04-03 15:48:08

MongoDB，PyMongo 如何按唯一字段计数过滤结果？

问题描述

1 个解决方案

解决方案1 1 已采纳 2020-04-03 15:48:08

解决方案1
1 已采纳 2020-04-03 15:48:08