Our general strategy for tagging usability is to set up a series of tags for:
Overall task performance: direct/indirect success or failure
And then other general comments or themes like ideas, accessibility, navigation, bug, etc.
Then we just use the filters to pull insights. Like, what were all the failures on X task from?
Best of luck!