简体   繁体   中英

Python convert (read & save) excel xlsx to xls

How can I convert an existing xlsx Excel file into xls while retaining my Excel file formatting? I use Anaconda Python 3, so I'm not sure I can use xlutils ... I was not able to install it via conda install xlutils because of lots of incompatibilities. So now I use this code without the xlutils.copy() :

import xlrd, xlwt

wb = xlrd.open_workbook(my_xlsx_excel_file)
# wb = xlutils.copy(wb)
wb.save(my_xlsx_excel_file[:-1])

And I get this error:

AttributeError: 'Book' object has no attribute 'save'

Thank you!

First things first: Why do you want to convert to .xls? This is usually a sign that you are using outdated tools somewhere in the process, and it might be better to use newer tools rather than convert the data to an older format.

But, if you really need to convert to .xls while preserving formatting, your only realistic choice at this time is to use Excel itself. You didn't say which platform you are using, but if it's Windows or Mac, and you have Excel installed, then the most straightforward way to automate Excel is probably xlwings . In principle this will allow you to use Python to open the .xlsx file in Excel (an actual, running instance of Microsoft Excel) and do "save as" to a .xls file.

I say "in principle" because I don't personally know how to do it in xlwings. (I don't really use that package.) Under the covers, xlwings is relying on pywin32 on Windows and appscript on Mac, so you could use those lower-level packages directly.

For example, if you are on Windows, you could do this:

from win32com.client import Dispatch

xl = Dispatch('Excel.Application')
wb = xl.Workbooks.Add(my_xlsx_excel_file)
wb.SaveAs(my_xlsx_excel_file[:-1], FileFormat=56)
xl.Quit()

The 56 is a magic constant indicating Excel 97-2003 format (for Windows).

Naturally, there should be a corresponding way to do this on a Mac with appscript. Just be aware that the file format constants may be different than on Windows.

Another solution would be to use subprocess to run Excel via powershell. This has the advantage of not using Windows-specific libraries so can work on WSL as well.

import subprocess
import textwrap
import os

xlsx = 'data.xlsx'
ps = 'script.ps1'

with open(ps, 'w') as f:
    f.write(textwrap.dedent('''\
        param ($File)
        $myDir = split-path -parent $MyInvocation.MyCommand.Path
        $excelFile = "$myDir\\" + $File
        $Excel = New-Object -ComObject Excel.Application
        $wb = $Excel.Workbooks.Open($excelFile)
        $out = "$myDir\\" + (Get-Item ("$myDir\\" + $File) ).Basename + ".xls"
        $wb.SaveAs($out, 56)
        $Excel.Quit()        
    '''))
p = subprocess.Popen(["powershell.exe", '.\\'+ps, xlsx])
p.communicate()
os.remove(ps)

You can try to use openpyxl, and install it by conda install openpyxl and it should work with python3.5* Then the following code might work

import openpyxl as xl
wb = xl.load_workbook("yourfile.xlsx")
wb.save("file.xls")

You can learn more from openpyxl documentation, https://openpyxl.readthedocs.io/en/default/

Enjoy !

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM