There are a few parser scripts around written in Powershell. They will take a html file and clear out the html code. From there you can place them in an object and pass through to a SQL Server table
I probably wouldn't do this in the database, but if pushed, I'd have a crack at loading the HTML into an XML type then use XML functions to pull it apart.