When i use XSLT 2.0 key and tokenize function, it's return items order getting changed based on key value. in our output we required retain the same o ...
When i use XSLT 2.0 key and tokenize function, it's return items order getting changed based on key value. in our output we required retain the same o ...
I need to replace the if-else section with a function enclosing the same if-else. For ex: Following is the code with if-else condition Now I want ...
I'm tokenizing sentences with the following function using regular expressions. The problem is that it removes words less than length three before ...
I was using this site to get a better understanding of the spacy library to tokenize https://machinelearningknowledge.ai/complete-guide-to-spacy-token ...
So I need to tokenize a string by all spaces not between quotes, I am using regex in Javascript notation. For example: becomes For my use case ...
So I need to select all spaces not between quotes and delete them, I am using regex in Javascript notation. For example: becomes UPDATE: I am l ...
I use uax_url_email tokenizer for email fields in our index. It works perfect and generates single token for normal emails like johndoe@yahoo.com. How ...
I'm a beginner in python so I already apologize for the very basic question. I have an assignment where I have to create a list of sentences and toke ...
I have a dataset consisting of tokenized tuples. My steps of pre-processing were first, tokenizing the words, and then normalizing slang words. But th ...
I'm building a pipeline in Spark NLP (version 3.2.1) to create Tokens from a string column that contains searched words by words separated by comma. ...
I am a novice of spaCy and am using spaCy to process medical literature. I found that Tokenizer would divide the Latin name composed of two words into ...
I'm trying to find out which nouns exist in a sentence, i'm using pos_tag from nltk but it's not working very well here is my code/function for exa ...
I am trying to check if a data frame attribute value contains a particular string or not. This is the code snippet: 'str' object has no attribute ' ...
OpenAI's new embeddings API uses the cl100k_base tokenizer. I'm calling it from the NodeJS client but I see no easy way of slicing my strings so they ...
I would like to translate greek characters to their common latin equivalents for the purpose of full-text search. Consider the following: The long ...
So I have a problem to find sentances containing certain words from text and outputting those sentances with their indexes (I mean sentance number in ...
I am trying to make a Countvectorizer with a custom tokenizer function. I am facing a weird problem with it. In below code temp_tok is a list of 5 val ...
I am using proc macros to parse a given input into a Node tree, for debugging purposes I want to stringify and print the output to see if I am success ...
My question is, what is the correct terminology for the process of reading a simple config and creating data structures corresponding to the data we ...
So my question is two-fold, I'm trying to make a computer algebra system similar to SymPy or Math.NET symbolics my idea is to use some sort of macro ...