I need to create a function that will accept a sting input of any length and return an array of strings each containing n long chunks. For example, an input of This is a test
with 3 character long chunks should return:
Thi
s i
s a
tes
t
I have created the following function to do so. My question is, is there possibly a better, faster way to approach this? I know that this function may be called many times using very long strings and I do not wan't this to possibly slow down the system. Additionally, I eventuallly need to set the function up so that it also creates a new entry upon detection of a delimiter. For example, with a "chunk length" of three:
Testing with comma delimiters, one, two, three, test
Should return:
Tes
tin
g w
ith
co
mma
del
imi
ter
s,
one
,
two
,
thr
ee,
te
st
Notice that i do not want the delimiter it's self to be deleted or replaced. I just have a new line/new array entry populate just after detection.
Here is my code so far:
CREATE OR REPLACE FUNCTION SPLIT_STRING (
p_str VARCHAR2, --String to split
p_del VARCHAR2, --Delimiter
p_len INTEGER, --Length of each chunk
p_force NUMBER) --Forces split when length is reached (1=on, 0=off)
RETURN VARCHAR2 IS
l_tmp_str VARCHAR2(32767);
l_chnk_len INTEGER;
l_str VARCHAR2(32767);
l_chunk VARCHAR2(32767);
l_pos INTEGER;
l_len INTEGER;
l_chnksize NUMBER;
BEGIN
--Determine the strings total length
l_len:= LENGTH(p_str);
IF (l_len > 0)
THEN
l_tmp_str:= p_str;
--Determine the necessary number of chuncks
l_chnksize:=(l_len/p_len);
IF MOD(l_chnksize,1) != 0
THEN
l_chnksize:= CEIL(l_chnksize);
END IF;
--Split the string into chunks
IF p_force = 1
THEN
l_pos:=1;
FOR loop_num IN 1..l_chnksize
LOOP
IF (loop_num>1)
THEN
l_str:=l_str||CHR(10)||CHR(13)||SUBSTR(p_str,l_pos,p_len);
ELSE
l_str:=SUBSTR(p_str,l_pos,p_len);
END IF;
--Increment position placeholder
l_pos:=l_pos+p_len;
END LOOP;
ELSE
l_str:='UNFORCED, NOT IMPLEMENTED';
END IF;
END IF;
--Return the delimited string
RETURN l_str;
My specific question is: Is there a FASTER way to do this for LARGE string inputs?
I don't know if this is faster, but definitely simpler. You are not actually putting the chunks in arrays, but inserting newline character after every delimiter or a group of characters. This can be easily done using regular expressions.
select regexp_replace('Testing with comhm,a sdfdeli,mitjers,one,two,three,test',
'(.{0,3},)|(.{5})',
'\1\2' ||chr(10)) chunks
from dual;
CHUNKS
-------
Testi
ng wi
th co
mhm,
a sdf
deli,
mitje
rs,
one,
two,
three
,
test
Regex Explanation:
These first and second capture groups are replaced by themselves appended with a newline character.
Generic expression would be,
'(.{0,'||(length-2)||'}'||delimiter||')|(.{'||(length)||'})'
The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.