简体   繁体   中英

Fit multivariate gaussian distribution to a given dataset

I need to fit multivariate gaussian distribution ie obtain mean vector and covariance matrix of the nearest multivariate gaussian for a given dataset of audio features in python. The audio features (MFCC coefficients) are a NX 13 matrix where N is around 4K. Can someone please outline the packages and technique to fit the gaussian for this data in python?

Use the numpy package. numpy.mean and numpy.cov will give you the Gaussian parameter estimates. Assuming that you have 13 attributes and N is the number of observations, you will need to set rowvar=0 when calling numpy.cov for your N x 13 matrix (or pass the transpose of your matrix as the function argument).

If your data are in numpy array data :

mean = np.mean(data, axis=0)
cov = np.cov(data, rowvar=0)

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM