Difference between revisions of "Separate Words"

From Second Life Wiki
Jump to navigation Jump to search
(factor out keepSpacers and discardSeparators, add caveats and preconditions, strike the dangling See Also link)
m (<lsl> tag to <source>)
 
(5 intermediate revisions by 3 users not shown)
Line 6: Line 6:
== Function: [[list]] separateWords([[string]] {{LSL Param|src}},[[list]] {{LSL Param|separators}},[[list]] {{LSL Param|spacers}}); ==
== Function: [[list]] separateWords([[string]] {{LSL Param|src}},[[list]] {{LSL Param|separators}},[[list]] {{LSL Param|spacers}}); ==
<div style="padding: 0.5em;">
<div style="padding: 0.5em;">
Returns the words of the '''{{LSL Param|src}}''' string by keeping the '''{{LSL Param|spacers}}''', discarding the '''{{LSL Param|separators}}''', and also getting the words in between.
Returns the words of the {{LSLP|src}} string by keeping the {{LSLP|spacers}}, discarding the {{LSLP|separators}}, and also getting the words in between.


Parameters:
Parameters:
Line 26: Line 26:
Caveats:
Caveats:


Same as llParseString2List, this function returns a list of strings, not a mixed list of strings and floats and such. Cast an entry of the list to the type you need, ''e.g.'', { string words = separateWords(...); integer value = (integer) llList2String(words, index); }. Remember that LSL cast to integer works better than llList2Integer, ''e.g.'', cast to integer understands hexadecimal integer literals such as 0x2A.
Same as llParseString2List, this function returns a list of strings, not a mixed list of strings and floats and such. Cast an entry of the list to the type you need, ''e.g.'', { string words = separateWords(...); integer value = (integer) llList2String(words, index); }. Remember that LSL cast to integer works better than llList2Integer, ''e.g.'', cast to integer understands hexadecimal integer literals such as "0x2A".
</div></div>
</div></div>


Line 32: Line 32:
== Implementation ==
== Implementation ==
<div style="padding: 0.5em;">
<div style="padding: 0.5em;">
<pre>
<source lang="lsl2">
// http://wiki.secondlife.com/wiki/Separate_Words
// http://wiki.secondlife.com/wiki/Separate_Words


Line 90: Line 90:
     return discardSeparators(keepSpacers([src], spacers), separators);
     return discardSeparators(keepSpacers([src], spacers), separators);
}
}
</pre>
</source>
</div></div>
</div></div>


Line 99: Line 99:
Asking to keep the spacers, discard the separators, and get the words between out of this '''{{LSL Param|src}}''':<br/>
Asking to keep the spacers, discard the separators, and get the words between out of this '''{{LSL Param|src}}''':<br/>
<br/>
<br/>
''42 0.99 \"00000000-0000-0000-0000-000000000000\" [abc, def] \"xyz\\\\\"zyx ijk\" <0, 1, 2, 3> // source literals''<br/>
42 0.99 "00000000-0000-0000-0000-000000000000" [abc, def] "xyz\\"zyx ijk" <0, 1, 2, 3> // source literals OK<br/>
<br/>
<br/>
'''says:'''<br/>
'''says:'''<br/>
Line 137: Line 137:
33: source<br/>
33: source<br/>
34: literals<br/>
34: literals<br/>
42 0.99 "00000000-0000-0000-0000-000000000000" [abc, def] "xyz\\"zyx ijk" <0, 1, 2, 3> // source literals
OK
OK


<pre>
<source lang="lsl2">
 
// Demo keeping the spacers, discarding the separators, and getting the words in between.
// Demo keeping the spacers, discarding the separators, and getting the words in between.
string src()
{
    return "42 0.99 \"00000000-0000-0000-0000-000000000000\"" +
        " [abc, def] \"xyz\\\\\"zyx ijk\" <0, 1, 2, 3> // source literals";
}


string LF = "\n";
string LF = "\n";
Line 168: Line 176:
{
{
     state_entry()
     state_entry()
     {      
     {
        string chars = "42 0.99 \"00000000-0000-0000-0000-000000000000\"
         list words = separateWords(src(), separators(), spacers);
            [abc, def] \"xyz\\\\\"zyx ijk\" <0, 1, 2, 3> // source literals";
         list words = separateWords(chars, separators(), spacers);
         ownerSayStrings(words);
         ownerSayStrings(words);
        llOwnerSay(src());
         llOwnerSay("OK");
         llOwnerSay("OK");
     }
     }
}
}
</pre>
</source>
</div></div>
</div></div>


Line 189: Line 196:
* [[llList2CSV]]
* [[llList2CSV]]
* [[LlParseStringKeepNulls]]
* [[LlParseStringKeepNulls]]
'''Script Library'''
* [[ParseString2List]]


'''Implementation Differences'''
'''Implementation Differences'''

Latest revision as of 16:58, 24 January 2015

Function: list separateWords(string src,list separators,list spacers);

Returns the words of the src string by keeping the spacers, discarding the separators, and also getting the words in between.

Parameters:

• string src source string
• list separators separators to be discarded
• list spacers spacers to be kept

This separateWords function works like llParseString2List but accepts many more spacers and separators. If you began by using llParseString2List and then your code grew to involve more than 8 spacers or separators, you might want to call separateWords, keepSpacers, and/or discardSeparators, in place of calling llParseString2List.

This separateWords function does not return the empty words that arguably exist between adjacent separators and spacers. The separateWords and llParseString2List functions do not return such words (i.e., do not return "any null values generated"). The llParseStringKeepNulls function does return such words.

Preconditions to avoid confusion:

  1. Provide lists of strings as the separators and spacers, not mixed lists of strings and floats and such.
  2. Don't let any spacer contain or equal a separator, and don't let any separator contain or equal a spacer.
  3. Do list each spacer and separator only once.

Caveats:

Same as llParseString2List, this function returns a list of strings, not a mixed list of strings and floats and such. Cast an entry of the list to the type you need, e.g., { string words = separateWords(...); integer value = (integer) llList2String(words, index); }. Remember that LSL cast to integer works better than llList2Integer, e.g., cast to integer understands hexadecimal integer literals such as "0x2A".

Implementation

// http://wiki.secondlife.com/wiki/Separate_Words

// Keep the spacers, discard the separators, and get the words in between, within
// astonishing limits described at http://wiki.secondlife.com/wiki/llParseString2List

list keepSpacersDiscardSeparators(list sources, list separators, list spacers)
{
    list words = [];

    integer index;
    integer sourcing = llGetListLength(sources);

    for (index = 0; index < sourcing; ++index)
    {
        string source = llList2String(sources, index);
        words += llParseString2List(source, separators, spacers);

    }
    return words;
}

// Keep the spacers and get the words in between.

list keepSpacers(list sources, list spacers)
{
    list words = sources;
    integer index;
    integer spacing = llGetListLength(spacers);
    for (index = 0; index < spacing; index += 8)
    {
        list someSpacers = llList2List(spacers, index, index + 8 - 1);
        words = keepSpacersDiscardSeparators(words, [], someSpacers);
    }
    return words;
}

// Discard the separators but get the words in between.

list discardSeparators(list sources, list separators)
{
    list words = sources;
    integer index;
    integer separating = llGetListLength(separators);
    for (index = 0; index < separating; index += 8)
    {
        list someSeparators = llList2List(separators, index, index + 8 - 1);
        words = keepSpacersDiscardSeparators(words, someSeparators, []);
    }
    return words;
}

// Keep the spacers and discard the separators and get the words in between.

list separateWords(string src, list separators, list spacers)
{
    return discardSeparators(keepSpacers([src], spacers), separators);
}

Demo

Asking to keep the spacers, discard the separators, and get the words between out of this src:

42 0.99 "00000000-0000-0000-0000-000000000000" [abc, def] "xyz\\"zyx ijk" <0, 1, 2, 3> // source literals OK

says:
0: 42
1: 0.99
2: "
3: 00000000
4: -
5: 0000
6: -
7: 0000
8: -
9: 0000
10: -
11: 000000000000
12: "
13: [
14: abc
15: def
16: ]
17: "
18: xyz
19: \
20: \
21: "
22: zyx
23: ijk
24: "
25: <
26: 0
27: 1
28: 2
29: 3
30: >
31: /
32: /
33: source
34: literals
42 0.99 "00000000-0000-0000-0000-000000000000" [abc, def] "xyz\\"zyx ijk" <0, 1, 2, 3> // source literals OK

// Demo keeping the spacers, discarding the separators, and getting the words in between.

string src()
{
    return "42 0.99 \"00000000-0000-0000-0000-000000000000\"" +
        " [abc, def] \"xyz\\\\\"zyx ijk\" <0, 1, 2, 3> // source literals";
}

string LF = "\n";
string DQUOTE = "\""; // double quote
string ESCAPE = "\\";

list spacers = [DQUOTE, "(", ")", "<", ">", "[", "]", "/", "+", "-", "*", "%", ESCAPE];

list separators()
{
    string TAB = llUnescapeURL("%09"); // != "\t"
    string CR = llUnescapeURL("%0D"); // != "\r"
    return [TAB, LF, CR, " ", ",", ";"];
}

ownerSayStrings(list strings)
{
    integer stringing = llGetListLength(strings);
    integer index;
    for (index = 0; index < stringing; ++index)
    {
        llOwnerSay((string) index + ": " + llList2String(strings, index));
    }        
}

default
{
    state_entry()
    {
        list words = separateWords(src(), separators(), spacers);
        ownerSayStrings(words);
        llOwnerSay(src());
        llOwnerSay("OK");
    }
}