Can’t convert Word Document Equation into HTML readable format text like MathML in C#

  softwareengineering

I have .net C# application in which I am extracting data from word file and save it in database particularly. To store word all data as it is I am converting the whole word file firstly in HTML and then checking html paragraph one by one.
At the time of converting from word file to html my equations which are in the word document file was convert into image.

Globals.ThisAddIn.Application.ActiveDocument.Select();
Microsoft.Office.Interop.Word.Document doc = Globals.ThisAddIn.Application.ActiveDocument;

string result = Path.GetTempPath();

string tmpFileName = Globals.ThisAddIn.Application.ActiveDocument.FullName;
doc.SaveEncoding = Microsoft.Office.Core.MsoEncoding.msoEncodingUSASCII;
if (File.Exists(result + "temp.html"))
{
    File.Delete(result + "temp.html");
}
doc.SaveAs(result + "temp.html", WdSaveFormat.wdFormatFilteredHTML); 

doc.Close(Microsoft.Office.Interop.Word.WdSaveOptions.wdDoNotSaveChanges);

HtmlAgilityPack.HtmlDocument mangledHTML = new HtmlAgilityPack.HtmlDocument();
mangledHTML.Load(result + "temp.html");


if (File.Exists(result + "newtemp.html"))
{
    File.Delete(result + "newtemp.html");
}

mangledHTML.Save(result + "newtemp.html");
// Remove standalone CRLF

string badHTML = File.ReadAllText(result + "newtemp.html");
badHTML = badHTML.Replace("rnrn", "ackThbbtt ");
badHTML = badHTML.Replace("rn", " ");
badHTML = badHTML.Replace("ackThbbtt ", "rn");
badHTML = badHTML.Replace('�', ' ');
if (File.Exists(result + "finaltemp.html"))
{
    File.Delete(result + "finaltemp.html");
}
File.WriteAllText(result + "finaltemp.html", badHTML);

// Clean up temp files, show the finished result in Notepad
File.Delete(result + "temp.html");
File.Delete(result + "newtemp.html");

Microsoft.Office.Interop.Word.Document orignalDoc = new Document();
orignalDoc = Globals.ThisAddIn.Application.Documents.Open(tmpFileName);

This code converts my word documents all equations in Images and as it convert in image I can’t show the equation properly in my application.

So I tried to convert this equations in MATHML form but I couldn’t solve this.

New contributor

Conduct dotnet is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.

LEAVE A COMMENT