I apologize - I just now noticed your reply. Perhaps you’d be better served with the lucene.smartcn analyzer? You can use the Explain Output to see how the query was formed by analyzing the query text. That query is bi-grammed and the highlights you’re showing look reasonable for the query given those bi-grams.