Two locking fixes:

 - Invoke the lockdep tracepoints in the correct place so the ordering
   is correct again.

 - Don't leave the mutex WAITER bit stale when the last waiter is dropping
   out early due to a signal as that forces all subsequent lock operations
   needlessly into the slowpath until it's cleaned up again.