Changeset 159:e4e3160be9c0 for classifier/tokenizer/filters.py
- Timestamp:
- 08/19/07 10:34:17 (13 months ago)
- Files:
-
- 1 modified
-
classifier/tokenizer/filters.py (modified) (1 diff)
Legend:
- Unmodified
- Added
- Removed
-
classifier/tokenizer/filters.py
r157 r159 157 157 lang = options['lang'] 158 158 stopwords = self._getStopWords(lang) 159 if 'treshold' in options: 160 tres = options['treshold'] 161 else: 162 tres = self.treshold 163 159 164 return [word for word in text if (word not in stopwords 160 and len(word) > self.treshold)]165 and len(word) > tres)] 161 166 162 167 registerFilter(StopWords())
