简体   繁体   中英

Remove from list where all items not included using LINQ

I have a generic list ItemList where if each ListID doesn't contain all possible ItemIDs , then I'd like to remove them from the list.

I'll illustrate below with a cut down version of the data:

MaxItemID = 4

ListID       ItemID       

   1            1       
   2            1 
   2            2 
   2            3
   2            4
   3            1
   3            2 
   4            1
   4            2
   4            3
   4            4
   5            1
   5            2

And the data I'd like out of that would be:

ListID       ItemID       

   2            1 
   2            2 
   2            3
   2            4
   4            1
   4            2
   4            3
   4            4

I asked a similar question yesterday relating just to SQL, and got an answer of using Exists , which was good, but I've since discovered that the maximum number of items could vary, and also I think I'd prefer to use linq. I thought using a distinct().Count() technique could work, but had no success so far, and it sounds like it's not the most efficient way either.

Thanks very much

I hope this is your purpose:

var list = 
     ItemList
    .GroupBy(p=>p.ListID)
    .Where(p=>p.Distinct().Count()<MaxItemID)
    .ToArray()
int distinctIDs = items.Select(x => x.ItemID).Distinct().Count();
var newList = items
    .GroupBy(x => x.ListID)
    .Where( x => x.Select(y=>y.ItemID).Distinct().Count() ==  distinctIDs)
    .SelectMany(x => x.ToList())
    .ToList();

Try this

var query = from baseItem in itemList
    group baseItem by baseItem.ListID into gr
    where gr.Count() == MaxItemID
    join selectItem in itemList on gr.Key equals selectItem.ListID
    select selectItem;

var requiredList = query.ToList();

This works, check the screen shot. 在此处输入图片说明

EDIT

If duplicates are allowed then query can be modified as

var query = from item in itemList
    group item by item.ListID into gr
    where gr.Select(s=>s.ItemId).Distinct().Count() == MaxItemID
    join i in itemList on gr.Key equals i.ListID
    select i;

Cannot comment on the performance of the Linq version but here are Linq and non-linq solutions

hth,
Alan

Edit

The sample data does show any duplicates of listId/ItemId so I omitted the distinct check but if needed it goes as the first part of the linq statement.

Edit 2

Have updated the Linq code.
Haven't updated the non-linq code. Just check to see if same a previous before adding (assumes sorted, ListId, ItemId as per sample input.

[TestMethod]
public void LinqVersion() {

    Dictionary<int, List<ListItem>> found = new Dictionary<int, List<ListItem>>();

    var comparer = new ListItemEqualityComparer();

    var actual = Input.Distinct(comparer).GroupBy(lstItem => lstItem.ListId).Where(grp => grp.Count() == 4).SelectMany(grp => grp);

    Assert.IsTrue(Expected.SequenceEqual(actual, comparer));

}


[TestMethod]
public void NoLinqVersion() {

    Dictionary<int, List<ListItem>> found = new Dictionary<int, List<ListItem>>();

    foreach (var listItem in Input) {
      AddItem(found, listItem.ListId, listItem);
     }

    var actual = new List<ListItem>();
    foreach (var pair in found) {
       if (pair.Value.Count == 4) {
           actual.AddRange(pair.Value);
       }
    }

    Assert.IsTrue(Expected.SequenceEqual(actual, new ListItemEqualityComparer()));

}

    private static void AddItem(IDictionary<int, List<ListItem>> dictionary, int listId, ListItem listItem) {
        if (!dictionary.ContainsKey(listId)) {
            dictionary.Add(listId, new List<ListItem>());
        }
        dictionary[listId].Add(listItem);
    }

    public class ListItem {
        public int ListId { get; set; }
        public int ItemId { get; set; }
    }

    public class ListItemEqualityComparer : IEqualityComparer<ListItem> {
        public bool Equals(ListItem x, ListItem y) {
            return x.ListId == y.ListId && x.ItemId == y.ItemId;
        }

        public int GetHashCode(ListItem obj) {
            return obj.ListId ^ obj.ItemId;
        }
    }

    public List<ListItem> Input = new List<ListItem>(){
        new ListItem{ ListId = 1 , ItemId = 1},
        new ListItem{ ListId = 2 , ItemId = 1},
        new ListItem{ ListId = 2 , ItemId = 2},
        new ListItem{ ListId = 2 , ItemId = 3},
        new ListItem{ ListId = 2 , ItemId = 4},
        new ListItem{ ListId = 3 , ItemId = 1},
        new ListItem{ ListId = 3 , ItemId = 2},
        new ListItem{ ListId = 4 , ItemId = 1},
        new ListItem{ ListId = 4 , ItemId = 2},
        new ListItem{ ListId = 4 , ItemId = 3},
        new ListItem{ ListId = 4 , ItemId = 4},
        new ListItem{ ListId = 5 , ItemId = 1},
        new ListItem{ ListId = 5 , ItemId = 2},
    };


    public List<ListItem> Expected = new List<ListItem>{
        new ListItem{ ListId = 2 , ItemId = 1},
        new ListItem{ ListId = 2 , ItemId = 2},
        new ListItem{ ListId = 2 , ItemId = 3},
        new ListItem{ ListId = 2 , ItemId = 4},
        new ListItem{ ListId = 4 , ItemId = 1},
        new ListItem{ ListId = 4 , ItemId = 2},
        new ListItem{ ListId = 4 , ItemId = 3},
        new ListItem{ ListId = 4 , ItemId = 4}
    };

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM