Hello @vinit sawant , I hope below information would be helpful for your initial query.
Speech Identification verify and identify speakers by their unique voice characteristics, by using voice biometry. You provide audio training data for a single speaker, which creates an enrollment profile based on the unique characteristics of the speaker's voice. You can then cross-check audio voice samples against this profile to verify that the speaker is the same person (speaker verification). You can also cross-check audio voice samples against a group of enrolled speaker profiles to see if it matches any profile in the group (speaker identification). Then you do recognition, it will recognize based on the file.
Characteristics somehow like nose, eye, lip,... in face recognition.