數值回歸測試

Question

我正在研究一種科學計算代碼（用C ++編寫），除了對較小的組件進行單元測試之外，我還想通過比較一個“已知的好”來對一些數字輸出進行回歸測試。從以前的修訂回答。 我想要一些功能：

允許將數字與指定的容差進行比較（對於舍入誤差和更寬松的期望）
能夠區分整數，雙精度等，並在必要時忽略文本
格式良好的輸出，以告訴出錯的地方和位置：在多列數據表中，只顯示不同的列條目
返回EXIT_SUCCESS或EXIT_FAILURE具體取決於文件是否匹配

是否有任何好的腳本或應用程序可以執行此操作，或者我是否必須在Python中自行編寫以讀取和比較輸出文件？ 當然，我不是第一個有這種要求的人。

Answer 1

您應該選擇PyUnit ，它現在是名稱unittest的標准庫的一部分。 它支持您要求的一切。 例如，使用assertAlmostEqual()完成容差檢查。

Answer 2

ndiff實用程序可能接近你正在尋找的東西：它就像diff一樣，但它會將數字的文本文件與所需的容差進行比較。

Answer 3

我最終寫了一個Python腳本來做我想要的更多或更少。

#!/usr/bin/env python

import sys
import re
from optparse import OptionParser
from math import fabs

splitPattern = re.compile(r',|\s+|;')

class FailObject(object):
    def __init__(self, options):
        self.options = options
        self.failure = False

    def fail(self, brief, full = ""):
        print ">>>> ", brief
        if options.verbose and full != "":
            print "     ", full
        self.failure = True


    def exit(self):
        if (self.failure):
            print "FAILURE"
            sys.exit(1)
        else:
            print "SUCCESS"
            sys.exit(0)

def numSplit(line):
    list = splitPattern.split(line)
    if list[-1] == "":
        del list[-1]

    numList = [float(a) for a in list]
    return numList

def softEquiv(ref, target, tolerance):
    if (fabs(target - ref) <= fabs(ref) * tolerance):
        return True

    #if the reference number is zero, allow tolerance
    if (ref == 0.0):
        return (fabs(target) <= tolerance)

    #if reference is non-zero and it failed the first test
    return False

def compareStrings(f, options, expLine, actLine, lineNum):
    ### check that they're a bunch of numbers
    try:
        exp = numSplit(expLine)
        act = numSplit(actLine)
    except ValueError, e:
#        print "It looks like line %d is made of strings (exp=%s, act=%s)." \
#                % (lineNum, expLine, actLine)
        if (expLine != actLine and options.checkText):
            f.fail( "Text did not match in line %d" % lineNum )
        return

    ### check the ranges
    if len(exp) != len(act):
        f.fail( "Wrong number of columns in line %d" % lineNum )
        return

    ### soft equiv on each value
    for col in range(0, len(exp)):
        expVal = exp[col]
        actVal = act[col]
        if not softEquiv(expVal, actVal, options.tol):
            f.fail( "Non-equivalence in line %d, column %d" 
                    % (lineNum, col) )
    return

def run(expectedFileName, actualFileName, options):
    # message reporter
    f = FailObject(options)

    expected  = open(expectedFileName)
    actual    = open(actualFileName)
    lineNum   = 0

    while True:
        lineNum += 1
        expLine = expected.readline().rstrip()
        actLine = actual.readline().rstrip()

        ## check that the files haven't ended,
        #  or that they ended at the same time
        if expLine == "":
            if actLine != "":
                f.fail("Tested file ended too late.")
            break
        if actLine == "":
            f.fail("Tested file ended too early.")
            break

        compareStrings(f, options, expLine, actLine, lineNum)

        #print "%3d: %s|%s" % (lineNum, expLine[0:10], actLine[0:10])

    f.exit()

################################################################################
if __name__ == '__main__':
    parser = OptionParser(usage = "%prog [options] ExpectedFile NewFile")
    parser.add_option("-q", "--quiet",
                      action="store_false", dest="verbose", default=True,
                      help="Don't print status messages to stdout")

    parser.add_option("--check-text",
                      action="store_true", dest="checkText", default=False,
                      help="Verify that lines of text match exactly")

    parser.add_option("-t", "--tolerance",
                      action="store", type="float", dest="tol", default=1.e-15,
                      help="Relative error when comparing doubles")

    (options, args) = parser.parse_args()

    if len(args) != 2:
        print "Usage: numdiff.py EXPECTED ACTUAL"
        sys.exit(1)

    run(args[0], args[1], options)

Answer 4

我知道我參加派對已經很晚了，但幾個月前我寫了nrtest實用程序，試圖讓這個工作流更容易。 聽起來它也可能對你有所幫助。

這是一個快速概述。 每個測試都由其輸入文件及其預期的輸出文件定義。 執行后，輸出文件存儲在便攜式基准測試目錄中。 然后，第二步將此基准與參考基准進行比較。 最近的更新啟用了用戶擴展，因此您可以為自定義數據定義比較函數。

我希望它有所幫助。

數值回歸測試

問題描述

4 個解決方案

解決方案1
3 2009-06-28 18:51:39

解決方案2
0 2009-07-01 01:33:15

解決方案3
0 已采納 2009-07-15 18:03:49

解決方案4
0 2016-03-12 06:26:23

數值回歸測試

問題描述

4 個解決方案

解決方案1 3 2009-06-28 18:51:39

解決方案2 0 2009-07-01 01:33:15

解決方案3 0 已采納 2009-07-15 18:03:49

解決方案4 0 2016-03-12 06:26:23

解決方案1
3 2009-06-28 18:51:39

解決方案2
0 2009-07-01 01:33:15

解決方案3
0 已采納 2009-07-15 18:03:49

解決方案4
0 2016-03-12 06:26:23