How does laziness affect benchmarking in Haskell?

Question

This question is related to the following question : How to force evaluation in Haskell?

I want to benchmark the algorithm quicksort for a list. For this I have made a certain number of files which have random numbers in them.

Here is the relevant part of the code in question :

import System.IO
import Data.Time
import Control.DeepSeq

getListFromFiles :: IO [[Int]]
quicksort :: (Ord a) => [a] -> [a]

main = do
  l <- getListFromFiles
  start <- getCurrentTime
  let l' = map quicksort l
  end <- l' `deepseq` getCurrentTime
  print (diffUTCTime end start)

I do not want to know want to measure the time the program takes to look into the files, just the one that the sorting takes. Because of laziness, I think that the list l is only evaluated when deepseq is called on the list l' and that gives a flawed benchmark. Am I correct ?

Answer 1

I think that the list l is only evaluated when deepseq is called on the list l'...

Correct.

...and that gives a flawed benchmark.

Let me make an assumption about what you mean by "flawed". I guess what you mean is that getCurrentTime will return a time from before the sort is fully completed. Under that assumption, no, the benchmark is not flawed. I'm not sure I can explain which part of your reasoning is wrong, though, because you don't say why you think the benchmark will be flawed.

However, there is a pitfall to be aware of that I suspect is different from the one you had in mind: you should make sure that the input list is fully evaluated before calling the starting getCurrentTime , thus:

  start <- l `deepseq` getCurrentTime

This may or may not matter, depending on exactly how you implemented getListFromFiles .

How does laziness affect benchmarking in Haskell?

Question

1 answers

solution1
3 ACCPTED 2017-10-04 16:59:27

How does laziness affect benchmarking in Haskell?

Question

1 answers

solution1 3 ACCPTED 2017-10-04 16:59:27

solution1
3 ACCPTED 2017-10-04 16:59:27