mlprimitives.custom.text.TextCleaner fails if text is empty #228

csala · 2020-01-16T16:13:21Z

When the collection of texts to clean contains an empty string "", the mlprimitives.custom.text.TextCleaner._remove_stopwords crashes.

In [1]: from mlprimitives.custom.text import TextCleaner                                                                                                                                                                                                                       

In [2]: cleaner = TextCleaner()                                                                                                                                                                                                                                                

In [3]: cleaner.produce(['not empty', ''])                                                                                                                                                                                                                                     
---------------------------------------------------------------------------
LangDetectException                       Traceback (most recent call last)
<ipython-input-3-342ec016e729> in <module>
----> 1 cleaner.produce(['not empty', ''])
...
~/.virtualenvs/MLPrimitives/lib/python3.6/site-packages/langdetect/detector.py in _detect_block(self)
    148         ngrams = self._extract_ngrams()
    149         if not ngrams:
--> 150             raise LangDetectException(ErrorCode.CantDetectError, 'No features in text.')
    151 
    152         self.langprob = [0.0] * len(self.langlist)

LangDetectException: No features in text.

The text was updated successfully, but these errors were encountered:

csala self-assigned this Jan 16, 2020

csala added the bug There is an error in the code that needs to be fixed label Jan 16, 2020

csala added this to the 0.2.4 milestone Jan 16, 2020

csala mentioned this issue Jan 16, 2020

Fix crash on TextCleaner. Add tests #229

Merged

csala closed this as completed in #229 Jan 16, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

mlprimitives.custom.text.TextCleaner fails if text is empty #228

mlprimitives.custom.text.TextCleaner fails if text is empty #228

csala commented Jan 16, 2020

mlprimitives.custom.text.TextCleaner fails if text is empty #228

mlprimitives.custom.text.TextCleaner fails if text is empty #228

Comments

csala commented Jan 16, 2020