Any idea how i can go about implementing an algorithm for identifying a song, by matching it with a clip of the song or human voice or tune. I dont expect anyone to write out the algo here :), i know it involves a lot of things together, but i would like to know how i can get started. What are the technologies that i will need to consider for doing something like it. I am looking at something similar to Shazaam or Midomi. Also, is there an online service that does the same and provides API for doing it.
Oh, it simple.
Split song into pieces, run FFT on each, extract few main magnitudes and store them as a hash with time information.
Then do the same with recorded audio and match with stored data with regards to time.
Simple isn't it? To be honest it's more complicated but idea is similar.
The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.