We get audiofiles of medical lectures. We send these out to be transcribed and then figure out at which time points in the transcript slides were changed. We then mark this on the transcript.
We then need to go back and listen to the audiofile to locate the corresponding (20-60) time points on the audiofile, which takes needless time.
What I need is a program that matches up the audiofile with the transcript to come up with the time points marked on the transcript.
For example: [url removed, login to view] = audiofile
[url removed, login to view] = marked up transcript:
xxxxxx xxxxxxxxx xxxxxxxxxxx xxxxxxxxxxx xxxxxxx
xxxxxxxxx xxxxxxxxx xxxxxx xxxxxxxx
xxxxxxxx xxxxxxxxx xxxxxxxxx xxxxxxx xxxxxxxx
xxxxxxx xxxxxxxx xxxxxxxxx xxxxxxxxxx xxxxxx
xxxxxx xxxxxxxx xxxxxxxxxx xxxxxxxx xxxx
xxxxxxxxx xxxxxxxxxx xxxxxxxxx xxxxxx
I need a program to read [url removed, login to view] and [url removed, login to view] and output:
slide 1 3:54
slide 2 6:23
slide 60 34:40
Ideally we would like to have the output in whatever script file we are using to control the audiofile, but this should be a detail. Also it would be nice to be able to manually verify and edit the machine-selected time points, either globally (subtract x or y milliseconds from all time points), or point by point.
This one is a challenge! Some of the audiofiles are not of great quality, have foregin speakers, etc. So a natural language recognition approach may or may not work. I leave it up to you!
1) Complete and fully-functional working program(s) in executable form as well as complete source code of all work done.
2) Deliverables must be in ready-to-run condition, as follows (depending on the nature of the deliverables):
a) For web sites or other server-side deliverables intended to only ever exist in one place in the Buyer's environment--Deliverables must be installed by the Seller in ready-to-run condition in the Buyer's environment.
b) For all others including desktop software or software the buyer intends to distribute: A software installation package that will install the software in ready-to-run condition on the platform(s) specified in this bid request.
3) All deliverables will be considered "work made for hire" under U.S. Copyright law. Buyer will receive exclusive and complete copyrights to all work purchased. (No GPL, GNU, 3rd party components, etc. unless all copyright ramifications are explained AND AGREED TO by the buyer on the site per the coder's Seller Legal Agreement).
Output can be an executable file that will work either on Windows or UNIX.