Format / Suppress Scientific Notation from Pandas Aggregation Results

Question

How can one modify the format for the output from a groupby operation in pandas that produces scientific notation for very large numbers?

I know how to do string formatting in python but I'm at a loss when it comes to applying it here.

df1.groupby('dept')['data1'].sum()

dept
value1       1.192433e+08
value2       1.293066e+08
value3       1.077142e+08

This suppresses the scientific notation if I convert to string but now I'm just wondering how to string format and add decimals.

sum_sales_dept.astype(str)

Answer 1

Granted, the answer I linked in the comments is not very helpful. You can specify your own string converter like so.

In [25]: pd.set_option('display.float_format', lambda x: '%.3f' % x)

In [28]: Series(np.random.randn(3))*1000000000
Out[28]: 
0    -757322420.605
1   -1436160588.997
2   -1235116117.064
dtype: float64

I'm not sure if that's the preferred way to do this, but it works.

Converting numbers to strings purely for aesthetic purposes seems like a bad idea, but if you have a good reason, this is one way:

In [6]: Series(np.random.randn(3)).apply(lambda x: '%.3f' % x)
Out[6]: 
0     0.026
1    -0.482
2    -0.694
dtype: object

Answer 2

Here is another way of doing it, similar to Dan Allan's answer but without the lambda function:

>>> pd.options.display.float_format = '{:.2f}'.format
>>> Series(np.random.randn(3))
0    0.41
1    0.99
2    0.10

or

>>> pd.set_option('display.float_format', '{:.2f}'.format)

Answer 3

You can use round function just to suppress scientific notation for specific dataframe:

df1.round(4)

or you can suppress is globally by:

pd.options.display.float_format = '{:.4f}'.format

Answer 4

If you want to style the output of a data frame in a jupyter notebook cell, you can set the display style on a per-dataframe basis:

df = pd.DataFrame({'A': np.random.randn(4)*1e7})
df.style.format("{:.1f}")

See the documentation here .

Answer 5

Setting a fixed number of decimal places globally is often a bad idea since it is unlikely that it will be an appropriate number of decimal places for all of your various data that you will display regardless of magnitude. Instead, try this which will give you scientific notation only for large and very small values (and adds a thousands separator unless you omit the ","):

pd.set_option('display.float_format', lambda x: '%,g' % x)

Or to almost completely suppress scientific notation without losing precision, try this:

pd.set_option('display.float_format', str)

Answer 6

I had multiple dataframes with different floating point, so thx to Allans idea made dynamic length.

pd.set_option('display.float_format', lambda x: f'%.{len(str(x%1))-2}f' % x)

The minus of this is that if You have last 0 in float, it will cut it. So it will be not 0.000070, but 0.00007.

Answer 7

如果您想使用这些值，例如作为 csvfile csv.writer 的一部分，可以在创建列表之前对数字进行格式化：

df['label'].apply(lambda x: '%.17f' % x).values.tolist()

Answer 8

Expanding on this useful comment, here is a solution setting the formatting options only to display the results without changing options permanently:

with pd.option_context('display.float_format', lambda x: f'{x:,.3f}'):
    display(sum_sales_dept)

dept
value1  119,243,300.0
value2  129,306,600.0
value3  107,714,200.0

Format / Suppress Scientific Notation from Pandas Aggregation Results

Question

8 answers

solution1
329 ACCPTED 2014-01-15 14:40:32

solution2
148 2017-10-10 17:12:33

solution3
38 2018-01-23 14:00:05

solution4
24 2019-06-18 11:08:27

solution5
13 2020-11-19 22:00:57

solution6
4 2020-09-07 17:24:47

solution7
0 2017-12-04 17:18:34

solution8
0 2022-07-11 09:03:48

Format / Suppress Scientific Notation from Pandas Aggregation Results

Question

8 answers

solution1 329 ACCPTED 2014-01-15 14:40:32

solution2 148 2017-10-10 17:12:33

solution3 38 2018-01-23 14:00:05

solution4 24 2019-06-18 11:08:27

solution5 13 2020-11-19 22:00:57

solution6 4 2020-09-07 17:24:47

solution7 0 2017-12-04 17:18:34

solution8 0 2022-07-11 09:03:48

solution1
329 ACCPTED 2014-01-15 14:40:32

solution2
148 2017-10-10 17:12:33

solution3
38 2018-01-23 14:00:05

solution4
24 2019-06-18 11:08:27

solution5
13 2020-11-19 22:00:57

solution6
4 2020-09-07 17:24:47

solution7
0 2017-12-04 17:18:34

solution8
0 2022-07-11 09:03:48