What features of audio do I need to extract in recognizing phonemes? And what algorithms are best to use in extracting these?

I don't have any experience specific to this area but a search over at IEEE Explore turned up quite a few results. You can read the abstracts and usually find the author's website if something interests you (the articles without a subscription are costly, unfortunately).

Be a part of the DaniWeb community

We're a friendly, industry-focused community of developers, IT pros, digital marketers, and technology enthusiasts meeting, networking, learning, and sharing knowledge.