I have transcribed audio with two speakers, I chose the audio identification option but the json produced just as the tea without speakers and then a whole bunch of data on timings of each speaker but not associated to the text.
For me this renders is less than ideal.
You are not logged in. Log in to post an answer.
A good answer clearly answers the question and provides constructive feedback and encourages professional growth in the question asker.