I guess turning metadata detection off is one solution (along with the regex for a word character). But i'm sure you have it enabled for a reason.. Alternatively, you could use a regex to look for characters that indicate typical sentences/paragraphs format. For example;
#Word ending with a full stop, exclamation point or question mark, a space (or new line) then another word (beginning with a capital letter)
\s\w+[.?!]\s[A-Z]+\s
You'd assume this would match multiple times in a document, so you could make the threshold 2 or 3+ to make sure you don't get a red herring in the metadata. If you do and it's isolated to a particular property, just add an exception for that particular pattern or keyword aka (Exclude "Keywords:" or "\w+:\s")
Dean