UTF-8 replacement issues

Are the search regex strings  and string variables configured for UTF-8? I could see cases where a right Markdown file is not picked up  by the forward linker:
```bash
-rw-r--r--@ 1 ksandvik  CORP\Domain Users   56 Mar 21 14:56 KSDict/སྟོང་པ.md

(master) ~> python3 obs-linkr.py /Volumes/Work/Tibworkspace/ -r
Empty alias (will be ignored): aliases
----------------------
linked རྣམ་པར་ཤེས་པ
linked སྟོང
linked རྣམས
----------------------
linked text copied to clipboard

(master) ~> python3 --version
Python 3.9.2
```
Clipboard contents:
```bash
རྣམ་པར་ཤེས་པ་རྣམས་སྟོང་པའོ།།
```
Note that the word is part of the grammatical construct སྟོང་པའོ , འོ means ending means genitive. 
Note that the regex didn't pick up སྟོང་པ.md, rather སྟོང . This is in Tibetan but it's standard UTF-8. Worst case it's just a python3 regex UTF-8 bug not properly handling rune combinations.




Provide feedback

Saved searches

Use saved searches to filter your results more quickly

UTF-8 replacement issues #16

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

UTF-8 replacement issues #16

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions