删除列表<t>多次出现的元素，就地</t>

Question

发布了一个类似的问题，但我没有代表在该线程中提出后续问题。 :(

如果我的List<T>包含多次出现的项， List.Distinct()将删除重复项，但原始项仍将保留在原处。 如果我想删除多次出现的项目，包括原始项目，对原始列表执行此操作的最有效方法是什么？

给定一个名为oneTime的List<int> ：

{ 4, 5, 7, 3, 5, 4, 2, 4 }

所需的 output 将在oneTime中：

{ 7, 3, 2 }

跟进@Enigmativity 的问题：

这是我的脚本正在执行的操作的伪版本。 它是在运行于 .NET3.5 上的 NinjaTrader 中完成的。

我将附上代码应该做什么的一般概念，我会附上实际的脚本，但除非使用 NinjaTrader，否则它可能没有用。

但本质上，有一个大的 z 循环。 每次通过时，一系列数字都会添加到“LiTics”中。 我不想打扰。 然后我将该列表传递给 function，并返回只出现一次的值列表。 然后我想在循环中每次都看到这些数字。

它最初工作，但在各种数据集上运行它，在循环几次之后，它开始报告多次出现的值。 我不确定为什么？

for(int z=1; z<=10000; z +=1)//Runs many times 
{ 
    if (BarsInProgress ==0 &&CurrentBar-oBarTF1>0 &&startScript )   //Some Condition
    {
        for(double k=Low[0]; k<=High[0]; k +=TickSize)  
        {   
            LiTics.Add(k);  
            //Adds a series of numbers to this list each time through z loop
            //This is original that I do not want to disturb
        }

        LiTZ.Clear();  //Display list to show me results Clear before populating
        LiTZ=GetTZone(LiTics); //function created in thread(below)
                               //Passing the undisturbed list that is modified on every loop
        foreach (double prime in LiTZ) { Print(Times[0] +",  " +prime);  }
        //Printing to see results   
    }

}//End of bigger 'z' loop

//Function created to get values that appear ONLY once
public List<double> GetTZone(List<double> sequence) 
{  
    var result =
        sequence
            .GroupBy(x => x)
            .Where(x => !x.Skip(1).Any())
            .Select(x => x.Key)
            .ToList();
    return result;
}

打印出来的图片和出了什么问题：屏幕截图。

Answer 1

因此，如果您可以拥有一个新列表，那么这是最简单的方法：

var source = new List<int>() { 4, 5, 7, 3, 5, 4, 2, 4 };

var result =
    source
        .GroupBy(x => x)
        .Where(x => !x.Skip(1).Any())
        .Select(x => x.Key)
        .ToList();

这给出：

{ 7, 3, 2 }

如果要从原始源中删除值，请执行以下操作：

var duplicates =
    new HashSet<int>(
        source
            .GroupBy(x => x)
            .Where(x => x.Skip(1).Any())
            .Select(x => x.Key));

source.RemoveAll(n => duplicates.Contains(n));

Answer 2

下面是List<T> class 的扩展方法，它从列表中删除出现不止一次的所有项目：

/// <summary>
/// Removes all the elements that have a key that appears more than once,
/// according to a specified key selector function.
/// </summary>
public static int RemoveDuplicatesByKey<TSource, TKey>(this List<TSource> list,
    Func<TSource, TKey> keySelector,
    IEqualityComparer<TKey> comparer = default)
{
    ArgumentNullException.ThrowIfNull(list);
    ArgumentNullException.ThrowIfNull(keySelector);
    Dictionary<TKey, int> occurences = new(list.Count, comparer);
    foreach (TSource item in list)
        CollectionsMarshal.GetValueRefOrAddDefault(
            occurences, keySelector(item), out _)++;
    return list.RemoveAll(item => occurences[keySelector(item)] > 1);
}

使用CollectionsMarshal.GetValueRefOrAddDefault方法 (.NET 6) 来计算每个元素的出现次数Dictionary<TKey, int>以提高效率。

使用示例：

List<int> list = new() {4, 5, 7, 3, 5, 4, 2, 4};
Console.WriteLine($"Before: [{String.Join(", ", list)}]");
int removedCount = list.RemoveDuplicatesByKey(x => x);
Console.WriteLine($"After: [{String.Join(", ", list)}], Removed: {removedCount}");

Output：

Before: [4, 5, 7, 3, 5, 4, 2, 4]
After: [7, 3, 2], Removed: 5

在线演示。

Answer 3

我为您提供两种选择，一种使用HashSet ，另一种使用Linq 。

选项1：

使用HashSet ，循环遍历集合，如果不存在则插入，如果存在则删除。

HashSet<int> hash = new HashSet<int>();

foreach(var number in list)
{
    if(!hash.Contains(number)) hash.Add(number);
    else hash.Remove(number);               
}
list = hash.ToList();

选项2：

简单的Linq,对元素进行分组并过滤计数>1的元素。

var list= list.GroupBy(g=>g)
    .Where(e=>e.Count()==1)
    .Select(g=>g.Key)
    .ToList();

存在使用大的性能增益HashSet超过Linq ，很明显， Linq （在这种情况下）需要多次迭代，其中作为HashSet使用单次迭代，并且提供查找（用于添加/删除）与O(1)的访问。

Elapsed Time (Using Linq): 8808 Ticks
Elapsed Time (Using HashSet): 51 Ticks

工作Demo

删除列表<t>多次出现的元素，就地</t>

问题描述

3 个解决方案

解决方案1
5 已采纳 2016-04-05 03:00:56

解决方案2
0 2023-01-07 02:20:18

解决方案3
-1 2016-04-05 02:20:44

删除列表<t>多次出现的元素，就地</t>

问题描述

3 个解决方案

解决方案1 5 已采纳 2016-04-05 03:00:56

解决方案2 0 2023-01-07 02:20:18

解决方案3 -1 2016-04-05 02:20:44

解决方案1
5 已采纳 2016-04-05 03:00:56

解决方案2
0 2023-01-07 02:20:18

解决方案3
-1 2016-04-05 02:20:44