繁体   English   中英

根据不同的字符VBA解析长字符串

[英]Parse long string based with different character VBA

我伤了头。 我需要解析这样的长字符串。

2003|Jaguar|S-Type|Base Sedan 4-Door|4.2L 4196CC V8 GAS DOHC Naturally Aspirated::Base To VIN # N52047 2003|Jaguar|S-Type|Base Sedan 4-Door|3.0L 183Cu. In. V6 GAS DOHC Naturally Aspirated::Base To VIN # N52047 2001|Jaguar|S-Type|Base Sedan 4-Door|4.0L 3996CC 244Cu. In. V8 GAS DOHC Naturally Aspirated::To VIN # N52047 2001|Jaguar|S-Type|Base Sedan 4-Door|3.0L 183Cu. In. V6 GAS DOHC Naturally Aspirated::To VIN # N52047 2002|Ford|Thunderbird 2002|Lincoln|LS 2002|Jaguar|S-Type|Base Sedan 4-Door|4.0L 3996CC 244Cu. In. V8 GAS DOHC Naturally Aspirated::To VIN # N52047 2000|Jaguar|S-Type|Base Sedan 4-Door|4.0L 3996CC 244Cu. In. V8 GAS DOHC Naturally Aspirated::To VIN # N52047 2002|Jaguar|S-Type|Base Sedan 4-Door|3.0L 183Cu. In. V6 GAS DOHC Naturally Aspirated::To VIN # N52047 2000|Jaguar|S-Type|Base Sedan 4-Door|3.0L 183Cu. In. V6 GAS DOHC Naturally Aspirated::To VIN # N52047 2000|Lincoln|LS 2003|Lincoln|LS 2001|Lincoln|LS 2003|Ford|Thunderbird 2004|Lincoln|LS 2004|Jaguar|S-Type|Base Sedan 4-Door|4.2L 4196CC V8 GAS DOHC Naturally Aspirated::Base To VIN # N52047 2004|Ford|Thunderbird 2005|Jaguar|S-Type|Sport Sedan 4-Door|3.0L 183Cu. In. V6 GAS DOHC Naturally Aspirated::Base / Sport To VIN # N52047 2005|Jaguar|S-Type|Base Sedan 4-Door|3.0L 183Cu. In. V6 GAS DOHC Naturally Aspirated::Base / Sport To VIN # N52047 2005|Lincoln|LS 2004|Jaguar|XJ8 2005|Jaguar|S-Type|Sport Sedan 4-Door|4.2L 4196CC V8 GAS DOHC Naturally Aspirated::Base / Sport To VIN # N52047 2006|Jaguar|S-Type|Base Sedan 4-Door|3.0L 183Cu. In. V6 GAS DOHC Naturally Aspirated::Base / VDP Edition To VIN # N52047 2006|Jaguar|S-Type|VDP Edition Sedan 4-Door|4.2L 4196CC V8 GAS DOHC Naturally Aspirated::Base / VDP Edition To VIN # N52047 2005|Jaguar|XJ8 2004|Jaguar|S-Type|Base Sedan 4-Door|3.0L 183Cu. In. V6 GAS DOHC Naturally Aspirated::Base To VIN # N52047 2006|Jaguar|S-Type|Base Sedan 4-Door|4.2L 4196CC V8 GAS DOHC Naturally Aspirated::Base / VDP Edition To VIN # N52047 2005|Ford|Thunderbird 2006|Lincoln|LS 2000|Jaguar|S-Type|Sport Sedan 4-Door|4.0L 3996CC 244Cu. In. V8 GAS DOHC Naturally Aspirated::To VIN # N52047 2002|Jaguar|S-Type|Sport Sedan 4-Door|4.0L 3996CC 244Cu. In. V8 GAS DOHC Naturally Aspirated::To VIN # N52047 2001|Jaguar|S-Type|Sport Sedan 4-Door|4.0L 3996CC 244Cu. In. V8 GAS DOHC Naturally Aspirated::To VIN # N52047 2002|Jaguar|S-Type|Base Sedan 4-Door|3.0L 2967CC 181Cu. In. V6 GAS DOHC Naturally Aspirated::To VIN # N52047 2005|Jaguar|S-Type|Sport Sedan 4-Door|3.0L 2967CC 181Cu. In. V6 GAS DOHC Naturally Aspirated::Base / Sport To VIN # N52047 2005|Jaguar|S-Type|Base Sedan 4-Door|4.2L 4196CC 256Cu. In. V8 GAS DOHC Naturally Aspirated::Base / Sport To VIN # N52047 2004|Jaguar|S-Type|Base Sedan 4-Door|3.0L 2967CC 181Cu. In. V6 GAS DOHC Naturally Aspirated::Base To VIN # N52047 2003|Jaguar|S-Type|Base Sedan 4-Door|4.2L 4196CC 256Cu. In. V8 GAS DOHC Naturally Aspirated::Base To VIN # N52047 2006|Jaguar|S-Type|Base Sedan 4-Door|3.0L 2967CC 181Cu. In. V6 GAS DOHC Naturally Aspirated::Base / VDP Edition To VIN # N52047 2004|Jaguar|S-Type|Base Sedan 4-Door|4.2L 4196CC 256Cu. In. V8 GAS DOHC Naturally Aspirated::Base To VIN # N52047 2005|Jaguar|S-Type|Sport Sedan 4-Door|4.2L 4196CC 256Cu. In. V8 GAS DOHC Naturally Aspirated::Base / Sport To VIN # N52047 2005|Jaguar|S-Type|Base Sedan 4-Door|3.0L 2967CC 181Cu. In. V6 GAS DOHC Naturally Aspirated::Base / Sport To VIN # N52047 2001|Jaguar|S-Type|Base Sedan 4-Door|3.0L 2967CC 181Cu. In. V6 GAS DOHC Naturally Aspirated::To VIN # N52047 2003|Jaguar|S-Type|Base Sedan 4-Door|3.0L 2967CC 181Cu. In. V6 GAS DOHC Naturally Aspirated::Base To VIN # N52047 2006|Jaguar|S-Type|Base Sedan 4-Door|4.2L 4196CC 256Cu. In. V8 GAS DOHC Naturally Aspirated::Base / VDP Edition To VIN # N52047 

更好的结构

我知道我的决赛桌有6栏3-(年份,品牌,型号)是必填项3-(修剪,引擎,备注)是可选的

值引擎与Notes合并,并具有字符“ ::”,其他则具有字符“ |”

决赛桌

这是我的代码的一部分-工作不正确。 任何建议和改进都受到欢迎和赞赏:)

Dim Ret
Dim Ret2
Dim strColumnA As String

strColumnA = wsTestComp.Range("A1")
Ret = Split(strColumnA, "|")
j = 1
k = 1
For i = LBound(Ret) To UBound(Ret)

    Debug.Print Ret(i)
    If IsNumeric(Ret(i)) Then
        wsTestComp.Range("A2").Offset(k, j).value = Ret(i)
        j = j + 1
    Else
        If IsNumeric(Right(Ret(i), 4)) Then
        Ret2 = Split(Ret(i), "::")
        For h = LBound(Ret2) To UBound(Ret2)
            If IsNumeric(Right(Ret(i), 4)) Then
            wsTestComp.Range("A2").Offset(k, j).value = Left(Ret2(h), Len(Ret2(h)) - 5)
            Else
            wsTestComp.Range("A2").Offset(k, j).value = Ret2(h)
            j = j + 1
            End If
        Next h

        k = k + 1
        Else
        wsTestComp.Range("A2").Offset(k, j).value = Ret(i)
        j = j + 1
        End If
        End If

Next i

使用VBScript.RegExp来定位车辆的年份,并用可以与其他混乱情况唯一区分开的模式替换现有模式,以使用Split功能 可以使用简单的Replace函数来处理双冒号。

Sub makeCars()
    Dim tmp As String, y As Long, bUSE_REGEX As Boolean
    Dim pattern As String, replacement As String
    Dim rgx As Object, cmat As Object
    Dim v1 As Variant, v2 As Variant

    bUSE_REGEX = True

    With Worksheets("Sheet1")
        tmp = .Range("A1").Value2
        tmp = Replace(tmp, Chr(58) & Chr(58), Chr(124))
        tmp = Replace(tmp, Chr(124), Chr(167))
    End With

    If bUSE_REGEX Then
        'REGEX method
        Set rgx = CreateObject("VBScript.RegExp")
        With rgx
            .Global = True
            .pattern = "\s[0-9]{4}\§"
            Set cmat = .Execute(tmp)
            For y = 0 To cmat.Count - 1
                replacement = Replace(cmat(y), Chr(32), Chr(182))
                tmp = Replace(tmp, cmat(y), replacement)
            Next y
        End With
    Else
        'non-REGEX method
        For y = 1950 To 2025
            tmp = Replace(tmp, Chr(32) & y & Chr(167), Chr(182) & y & Chr(167))
        Next y
    End If

    With Worksheets("Sheet1")
        v1 = Split(tmp, Chr(182))
        For y = LBound(v1) To UBound(v1)
            v2 = Split(v1(y), Chr(167))
            .Cells(y + 2, 1).Resize(1, UBound(v2) + 1) = v2
        Next y
    End With

End Sub

我提供了RegEx解决方案的替代方案,只需骑行75年可能的汽车就可以了。 虽然有点像“蛮力”,但它可以完成工作,甚至很难以毫秒为单位来衡量两种方法之间的差异。 在这种情况下,这是可行的,因为可能的年份受到合理限制; RegEx应处理更广泛的可能性。

regex_car_models

关键是认出年份

这是一个“裸”代码

Option Explicit

Sub parsestring()

Dim Ret As Variant
Dim i As Long
Dim rng As Range

Set rng = ThisWorkbook.Worksheets("parse").Cells(1, 1) '<== cell with the string to parse

Ret = Split(Replace(Replace(rng.Value, "|", " |"), "::", " |"), " ")
For i = LBound(Ret) To UBound(Ret)
    If Ret(i) Like "####" Then Ret(i) = "§§" & Ret(i)
Next i
Ret = Split(Join(Ret), "§§")

With rng.Offset(2, 2) '<== the "database" will be placed two rows and columns away from the cell with the string to parse
    .Resize(UBound(Ret) + 1) = WorksheetFunction.Transpose(Ret)
    .Resize(UBound(Ret) + 1).TextToColumns Destination:=.Cells(1, 1), DataType:=xlDelimited, Other:=True, OtherChar:="|"
    .CurrentRegion.EntireColumn.AutoFit
End With

End Sub

还有一些格式化和数据排序

Sub parsestring2()

Dim Ret As Variant
Dim i As Long
Dim rng As Range

Set rng = ThisWorkbook.Worksheets("parse").Cells(1, 1) '<== cell with the string to parse


Ret = Split(Replace(Replace(rng.Value, "|", " |"), "::", " |"), " ")
For i = LBound(Ret) To UBound(Ret)
    If Ret(i) Like "####" Then Ret(i) = "§§" & Ret(i)
Next i
Ret = Split(Join(Ret), "§§")

With rng.Offset(2, 2) '<== the "database" will be placed two rows and columns away from the cell with the string to parse
    .Resize(UBound(Ret) + 1) = WorksheetFunction.Transpose(Ret)
    .Resize(UBound(Ret) + 1).TextToColumns Destination:=.Cells(1, 1), DataType:=xlDelimited, Other:=True, OtherChar:="|"
    With .Resize(1, 6)
        .Value = Array("Year", "Make", "Model", "Trim", "Engine", "Notes")
        .Interior.ColorIndex = 16
        .Font.ColorIndex = 2
    End With
    .CurrentRegion.Sort key1:="Year", order1:=xlDescending, key2:="Make", order2:=xlAscending, key3:="Model", order3:=xlAscending, header:=xlYes
    .CurrentRegion.EntireColumn.AutoFit
End With

End Sub

暂无
暂无

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM