Is `built-in method numpy.core._multiarray_umath.implement_array_function` a performance bottleneck?

Question

I'm using numpy v1.18.2 in some simulations, and using inbuilt functions such as np.unique , np.diff and np.interp . I'm using these functions on standard objects, ie lists or numpy arrays.

When I checked with cProfile , I saw that these functions make a call to an built-in method numpy.core._multiarray_umath.implement_array_function and that this method accounts for 32.5% of my runtime. To my understanding this is a wrapper that performs some checks to make sure the that the arguments passed to the function are compatible with the function.

I have two questions:

Is this function ( implement_array_function ) actually taking up so much time or is it actually the operations I'm doing ( np.unique , np.diff , np.interp ) that is actually taking up all this time? That is, am I misinterpreting the cProfile output? I was confused by the hierarchical output of snakeviz. Please see snakeviz output here and details for the function here .
Is there any way to disable it/bypass it, since the inputs need not be checked each time as the arguments I pass to these numpy functions are already controlled in my code? I am hoping that this will give me a performance improvement.

I already saw this question ( what is numpy.core._multiarray_umath.implement_array_function and why it costs lots of time? ), but I was not able to understand what exactly the function is or does. I also tried to understand NEP 18 , but couldn't make out how to exactly solve the issue. Please fill in any gaps in my knowledge and correct any misunderstandings. Also I'd appreciate if someone can explain this to me like I'm 5 (r/explainlikeimfive/) instead of assuming developer level knowledge of python.

Answer 1

All the information below is taken from NEP 18 .

Is this function ( implement_array_function ) actually taking up so much time or is it actually the operations I'm doing ( np.unique , np.diff , np.interp ) that is actually taking up all this time?

As @hpaulj correctly mentioned in the comment, the overhead of the dispatcher adds 2-3 microseconds to each numpy function call. This will probably be shorten to 0.5-1 microseconds once it is implemented in C. See here .

Is there any way to disable it/bypass it

Yes, from NumPy 1.17, you can set the environment variable NUMPY_EXPERIMENTAL_ARRAY_FUNCTION to 0 ( before importing numpy ) and this will disable the use of implement_array_function (See here ). Something like

import os
os.environ['NUMPY_EXPERIMENTAL_ARRAY_FUNCTION'] = '0'
import numpy as np

However, disabling it probably would not give you any notable performance improvement as the overhead of it is just few microseconds, and this will be the default in later numpy version too.

Is `built-in method numpy.core._multiarray_umath.implement_array_function` a performance bottleneck?

Question

1 answers

solution1
1 2021-02-19 17:59:52

Is `built-in method numpy.core._multiarray_umath.implement_array_function` a performance bottleneck?

Question

1 answers

solution1 1 2021-02-19 17:59:52

solution1
1 2021-02-19 17:59:52