From quick testing, NeMo's logical reasoning performance is very poor compared to even something like phi-3-mini.