I come from a Linux shop and am now supporting a fleet of Windows workstations and servers along with development tasks.
I have found that there is a shocking amount of instability with Windows servers and workstations. I can't believe business can operate like this. I'm currently trying to rebuild a critical PC as it freezes up all the time. Debugging the issue is a nightmare.
Our servers regularly stop working, some interaction between active directory and DNS. I haven't been able to get to the bottom of it. Every second Sunday the server requires a reboot else the entire network doesn't seem to work.
This didn't happen with Linux things just worked or you knew why they didn't.
Is my experience typical or just bad luck?
Having said that the environment you describe seems to be exceptionally flaky. I suspect that there are probably some serious misconfiguration issues, possibly some poor quality or resource constrained hardware, bad power (e.g. working in an industrial environment without UPS, power conditioners). Incorrect Windows licensing for servers can sometimes be an issue. I have seen environments that were incorrectly installed and configured causing significant issue.
Why did the person before you leave?
Define "instability"
Mostly-unmanaged, end-user workstations? Try better eucation for your users and/or better device management
But servers?! Sorry. But if you have Windows-based software running on anything newer then 2008R2 or 2012, it's not Windows' fault: it's the software you're running.
I've been working in and around massive enterprise deployments of multiple OSes for over a decade and a half. Pretty much every server Windows release newer than 2003 (and 2003 was pretty dang stable) is fine.
The software running on it can be flaky as all get out - but the OS is fine
Plan your maintenance windows. Apply patches in a timely manner. And you'll be fine.
At this level, it's all about sales team relationships to key decision makers rather than quality.
This is why larger b2b company all-hands meetings are always 80% about the sales team's exploits - they really do make or break the company.
I had this issue recently and it caused a lot of head scratching....
That's no longer my experience. Modern Windows is quite stable. Modern Linux gives me more issues, systemd related sometimes, and every time I update and get a new kernel, I have to struggle to get wifi back.
I will admit that my Windows 10 sometimes reboots itself overnight for updates, but other than that I often have quite long uptimes on it.
This shouldn't be a statement any IT staffer can utter
There should never be such a thing as "a critical PC" - single points of failure are an IT 101 no-no
"freezes up all the time" indicates an application issue 99% of the time, not an OS issue
"Debugging the issue is a nightmare": what are you doing to troubleshoot the issue? How are you isolating the problem(s)? Odds are exceptionally good that it's not a root cause analysis, but a root causes analysis. What logs are you checking? What is going on when it dies?