简体繁体中英

How I do I get a second sample from a dataset in Python without getting duplication from a first sample?

原文 2021-11-24 10:44:37 4 1 python/ random/ dataset

I have a python dataset that I have managed to take a sample from and put in a second dataset. After that I will need to produce another sample from the original dataset but I do not want any of the first sample to come up again. Ideally this would need any flag would only be there for a year so it can then be sampled again after that time has elapsed.

1 answers

Denote your original dataset with A. You generate a subset of A, denote it with B1. You can then create B2 from A_leftover = A \ B1, where \ denotes the set difference. You can then generate B3, B4, ... B12 from A_leftover, where Bi is generated from A_leftover = B(i-1).

If you want to put back B1 in the next year, A_leftover = A_leftover \ B12 U B1, and from this, you can generate the subset for B13 (or you can denote it with B1 as 13%12 = 1). So after 12, you can say you can generate Bi from A_leftover = A_leftover \ B(i-1) UB(i-11). Or you can use this formula from the very beginning, defining B(-i) = empty set for every i in [0,1,2,...,10].

python: how do I randomly sample a number of samples from a population?

How do I filter out random sample from dataframe in which there are different sample size for each value, in python?

how to sample from a dataset and get the indices of samples in initial dataset

How do I construct a random sample from a subdivided population?

How do I sample a certain number of rows in a database table from a python script?

How to do i print some numbers using .sample() from the random built in module in python

How do I remove brackets, quotes and commas from the output of random.sample (discord bot made with python)

How do I randomly sample from a list in python while maintaining the distribution of data

How to run a sample code which I get as input from Python GUI?

How can I sample 10% of a dataset in Tensorflow?

暂无

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

Related Question python: how do I randomly sample a number of samples from a population? How do I filter out random sample from dataframe in which there are different sample size for each value, in python? how to sample from a dataset and get the indices of samples in initial dataset How do I construct a random sample from a subdivided population? How do I sample a certain number of rows in a database table from a python script? How to do i print some numbers using .sample() from the random built in module in python How do I remove brackets, quotes and commas from the output of random.sample (discord bot made with python) How do I randomly sample from a list in python while maintaining the distribution of data How to run a sample code which I get as input from Python GUI? How can I sample 10% of a dataset in Tensorflow?

Related Tags

How I do I get a second sample from a dataset in Python without getting duplication from a first sample?

Question

1 answers

solution1 0 2021-11-26 22:03:41

solution1
0 2021-11-26 22:03:41