简体   繁体   English

从SQL Server中的数据字段中提取字符串

[英]Extracting a string from data field in sql server

I am trying to select the string between every pair <....> in the fifth column "QuestionTags". 我试图在第五列“ QuestionTags”中选择每对<....>之间的字符串。

here is a sample of the data: 这是数据示例:

在此处输入图片说明

i used CHARINDEX function but it returned me an integer. 我使用了CHARINDEX函数,但返回了一个整数。

i also used SUBSTRING but it asks me to define the character index and length of string. 我还使用了SUBSTRING,但它要求我定义字符索引和字符串长度。

Any suggestions? 有什么建议么?

If you are open to a Table-Valued-Function and not using 2016. 如果您愿意使用表值函数而不使用2016。

Tired of extracting strings (charindex,left,right,substring,...), I modified a Parse/Split function to accept two NON-LIKE delimiters. 厌倦了提取字符串(charindex,left,right,substring等),我修改了Parse / Split函数以接受两个NON-LIKE分隔符。 In your case a < and > 就您而言, <>

Example

Declare @YourTable table (ID int,QuestionTags varchar(max))
Insert Into @YourTable values
 (1,'<php><arrays><cloud><tag-cloud>')
,(2,'<windows><mailto>')

Select A.ID
      ,B.*
 From  @YourTable A
 Cross Apply [dbo].[udf-Str-Extract](A.QuestionTags,'<','>') B

Returns 退货

ID  RetSeq  RetPos  RetVal
1   1       2       php
1   2       7       arrays
1   3       15      cloud
1   4       22      tag-cloud
2   1       2       windows     --<< Second Record
2   2       11      mailto

The UDF if interested UDF(如果有兴趣)

CREATE FUNCTION [dbo].[udf-Str-Extract] (@String varchar(max),@Delimiter1 varchar(100),@Delimiter2 varchar(100))
Returns Table 
As
Return (  

with   cte1(N)   As (Select 1 From (Values(1),(1),(1),(1),(1),(1),(1),(1),(1),(1)) N(N)),
       cte2(N)   As (Select Top (IsNull(DataLength(@String),0)) Row_Number() over (Order By (Select NULL)) From (Select N=1 From cte1 N1,cte1 N2,cte1 N3,cte1 N4,cte1 N5,cte1 N6) A ),
       cte3(N)   As (Select 1 Union All Select t.N+DataLength(@Delimiter1) From cte2 t Where Substring(@String,t.N,DataLength(@Delimiter1)) = @Delimiter1),
       cte4(N,L) As (Select S.N,IsNull(NullIf(CharIndex(@Delimiter1,@String,s.N),0)-S.N,8000) From cte3 S)

Select RetSeq = Row_Number() over (Order By N)
      ,RetPos = N
      ,RetVal = left(RetVal,charindex(@Delimiter2,RetVal)-1) 
 From  (
        Select *,RetVal = Substring(@String, N, L) 
         From  cte4
       ) A
 Where charindex(@Delimiter2,RetVal)>1

)
/*
Max Length of String 1MM characters

Declare @String varchar(max) = 'Dear [[FirstName]] [[LastName]], ...'
Select * From [dbo].[udf-Str-Extract] (@String,'[[',']]')
*/

If you can use SQL Server 2016 for this then there is a built in string_split function that will do the job 如果您可以为此使用SQL Server 2016,则有一个内置的string_split函数可以完成此工作

SELECT *
FROM   YourTable
       OUTER APPLY (SELECT SUBSTRING(value, 2, 8000) value
                    FROM   string_split(QuestionTags, '>')
                    WHERE  value <> '') OA 

A demo in Stack Exchange Data Explorer as it looks like you are using SE data. 看起来像您正在使用SE数据的Stack Exchange Data Explorer中的演示。

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM