Hi Tech folks,
I wanted to get the thumbnail of the first page of Microsoft Office documents(MS word, MS excel, MS ppt,etc) using python.
There are many ways to do it but my requirements is to get thumbnails of million of documents, so i am using Azure Batch Processing using python. So the restriction is that i can't use any software which i have to install or executed explicitly or manually.
I already have a code which converts pdf files to thumbnails so if i can convert office files to pdf then i can achieve my goal but the problem is i am not able to find any python package which can convert to pdf, i also read about COM components which can convert Office docs to pdf but i don't want any installations as it will not fit in my batch processing, if there is any silent installations which can be done using command line then we can do that. Please suggest any alternatives.
What I have tried:
I have tried COM but it does not fit my requirements