this post was submitted on 21 Dec 2023

56 points (91.2% liked)

Linux

48689 readers

426 users here now

From Wikipedia, the free encyclopedia

Linux is a family of open source Unix-like operating systems based on the Linux kernel, an operating system kernel first released on September 17, 1991 by Linus Torvalds. Linux is typically packaged in a Linux distribution (or distro for short).

Distributions include the Linux kernel and supporting system software and libraries, many of which are provided by the GNU Project. Many Linux distributions use the word "Linux" in their name, but the Free Software Foundation uses the name GNU/Linux to emphasize the importance of GNU software, causing some controversy.

Rules

Posts must be relevant to operating systems running the Linux kernel. GNU/Linux or otherwise.
No misinformation
No NSFW content
No hate speech, bigotry, etc

Related Communities

Community icon by Alpár-Etele Méder, licensed under CC BY 3.0

founded 5 years ago

MODERATORS

[email protected]

Super weird error, what's happening? (lemmy.zip)

submitted 1 year ago by SuperSpruce to c/[email protected]

36 comments fedilink hide all child comments

I'm not sure if this is the best community to post in, but I just bought a used computer and slotted in an RX480 as the GPU. I installed KDE Neon 5.27 on it, and it worked flawlessly for 2 days.

Then, even though it was working earlier today, it slept and then would not wake up. So I turned off the power and turned it back on again, and was greeted with this error screen:

The only prior error message I'd gotten from the system was when I tried to install wine for one application, it told me some packages weren't up to date, without a way to fix it. I can enter the BIOS just fine.

What is going on? How do I fix this?

all 38 comments

sorted by: hot top controversial new old

[–] [email protected] 21 points 1 year ago* (last edited 1 year ago) (2 children)

None of what's visible helps identifying the error. Try journalctl -xb as suggested it might show more relevant information

Edit: oops should've been joirnalctl instead of journal

[–] SuperSpruce 9 points 1 year ago (2 children)

I tried to do that, and it couldn't find the journal package. So I tried to install it, but apt, flatpak, nor snap could find the package to install.

[–] [email protected] 25 points 1 year ago (2 children)

This was probably supposed to say "journalctl -xb"

[–] SuperSpruce 5 points 1 year ago (1 children)

Okay, that command works for me. The last line says that /etc/hosts:7: hostname "SuperSpruce_Iron_3900X" is not valid, ignoring.

[–] [email protected] 13 points 1 year ago

Not sure if this is the root cause of your boot failure, but underscores in hostnames are not allowed. A- Z, 0-9 and - are the only allowed characters.

[–] [email protected] 1 points 1 year ago

Oops 😬

[–] [email protected] 7 points 1 year ago

Read the error again. It's journalctl.

[+] [email protected] -27 points 1 year ago (2 children)

See. Stuff like that is why i started going non-Systemd.

[–] [email protected] 21 points 1 year ago* (last edited 1 year ago) (2 children)

You're welcome to use whatever init system you want, but Systemd solves a lot of the bullshit problems and limitations that come from init.d init scripts. Systemd also has a lot of its own bullshit and bloat, but it does an excellent job at actually being an init system and service manager if you know how to properly use it.

[–] [email protected] 7 points 1 year ago* (last edited 1 year ago)

solves a lot of the bullshit problems and limitations that come from init.d init scripts.

So do the other ~7 init systems developed since then. And, as far as i know, all of them print their relevant trouble directly to stderr. Who cares about SysV still?

Hey guys, why all the downvotes? Systemd is known for throwing all the irrelevant stuff at you, making it troublesome to debug. Which is why i switched. And i can confirm: Runit, S6, OpenRC and even simple Dinit are way better in that regard (and they do make less trouble generally).

[–] [email protected] -1 points 1 year ago

Almost everything you said is mere brochureware perpetuated by a tribe stronger than the vi mafia.

Sysvinit starts fast, starts well, and doesn't try to control mounts, cron, Getty, and everything else.

The"but it retries things" whine was a solved problem in 2001. So easy.

The EL6 machines I have in storage start faster than the el7 machines joining them. PCLinuxOS is a very valid non-systemd system that only lacks a documented kickstart emulant.

[–] [email protected] 2 points 1 year ago (1 children)

I doubt other init systems can make the screen bigger so that the vital part of the kernel log is still on screen, but be my guest to prove me wrong 🙄

[–] [email protected] 0 points 1 year ago (1 children)

https://wiki.archlinux.org/title/Kernel_mode_setting

It's a kernel thing.

[–] [email protected] 1 points 11 months ago

Thank you for proving my point.

[–] [email protected] 12 points 1 year ago

Shit's broke yo.

Sleep/wake issues with AMD gpu and platform drivers are super, super, super common. Fish back through your kernel journal after a reboot (journalctl -kb -1 should do it) and look for the driver errors immediately after the wake event. If this has been fixed in a later kernel release then update your kernel, if not go report it to either the Ubuntu folks or on the amdgpu gitlab.

[–] [email protected] 5 points 1 year ago (1 children)

Can you remove the GPU and use onboard?

It is mentioning gpu in the errors, so it would be the first thing I would try, to see if the errors change, because I have no idea what's going on here

[–] SuperSpruce 7 points 1 year ago

The computer is running a Ryzen 9 3900X, which does not have onboard graphics unfortunately.

[–] [email protected] 5 points 1 year ago

That monitor just keeps going

[–] [email protected] 4 points 1 year ago

Have you tried booting in with a live usb? You might be able to do some sort of recovery from there.

Having said, I’m still very much a Linux noob.

[–] [email protected] 3 points 1 year ago

Before doing anything, if your screen allows it, swap DP to HDMI or HDMI to DP as output, that may fix this to the point of being able to actually boot and further fix the issue.

I've had this before with drivers where suddenly it would fail on either port but would still run on one of the others.

[–] possiblylinux127 3 points 1 year ago* (last edited 1 year ago)

Can you please post the output of journalctl -xb

[–] [email protected] 2 points 1 year ago (1 children)

Try updating your DM / your entire system from the emergency mode.

[–] SuperSpruce 4 points 1 year ago (1 children)

How do I get into emergency mode?

[–] [email protected] 3 points 1 year ago (1 children)

The Shell where you typed "systemctl reboot" and "exit".

If you are running KDE neon, try "apt update" and "apt upgrade". If It doesn't work do "sudo apt update" and "sudo apt upgrade"

[–] SuperSpruce 4 points 1 year ago (2 children)

I ran apt update and some index files failed to download. It was just a warning though.

But systemctl reboot and exit still fail the same way.

[–] [email protected] 3 points 1 year ago

Before the "systemctl" command: try removing the GPU and booting it up without the GPU If it works, you can skip the "systemctl" commands

[–] [email protected] 3 points 1 year ago (5 children)

Run "journalctl --lines 200" and send photos of output.

NOTE: This is all of the logging of the computer, and it's long (that command select the last 200 entries), so you might have to scroll down using the PageDown key (or arrow down) in order make the photos of everything

[–] SuperSpruce 3 points 1 year ago (1 children)

[–] [email protected] 4 points 1 year ago* (last edited 1 year ago) (1 children)

The RAID1 seems to be failing according to that screenshot. That breaks the "Local File Systems" task and since quite a lot of things tend to depend on that, many things usually end up failing in an annoying cascade failure. It's also failing with a timeout instead of a strict error, which is odd.

Either way, I'd try commenting that line for /mnt/raid in /etc/fstab for now and seeing if that makes the system boot. It's possible that journalctl -u dev-md0.service or systemctl status dev-md0.service might tell you more, but it's 50/50 if it'll be anything useful.

[–] SuperSpruce 1 points 1 year ago (1 children)

How do I edit /etc/fstab if I'm not even able to boot the system? Or am I already booted in the system, just in a CLI environment?

[–] [email protected] 10 points 1 year ago (1 children)

You're most likely booted, otherwise you might need a live USB. Hopefully, the system isn't in read-only mode. What I'd recommend doing is:

cp /etc/fstab /etc/fstab.backup

To make a copy once. Then, nano /etc/fstab to run nano, a basic CLI editor. You can use the arrow keys to navigate and type freely in it. The hints like ^O shown on the bottom mean ctrl+o.

You'd use the arrow keys to go down to the line that probably says /dev/md0 /mnt/raid morecrap, put a # in front of it, press ctrl+w then enter to save. If that worked, ctrl+x to exit and try a reboot again.

Obviously can't promise this is "the" error preventing the system from booting, but it's generally a good idea to disable broken stuff like this to get the system working again, then fix it from there. Hopefully, this does the trick. Your RAID setup will not be activated on reboot after you do this but it's not going to permanently delete data or anything.

[–] SuperSpruce 2 points 1 year ago (1 children)

I used nano to edit /etc/fstab and commented out the last line and the system booted into GUI mode!

This leaves me with some questions:

Why does fstab fail to mount the NTFS raid array?
Why does the raid array failing to mount block the EDID signal? It's not like the OS lives on the raid array.
How do I properly mount the raid array and how do I automate it every boot if I can't use fstab?

[–] [email protected] 1 points 1 year ago

Looks like you need to look for messages about /dev/md0 and why it may be timing out. Also maybe add nofail to the raid entry in fstab so you can still boot if the root fs is not on it and it fails ( is root on NTFS possible or good?)

I don't think the edid message is a problem, just an artifact of your monitor not talking to your video card?

Maybe NTFS is the problem, I think it needs special options to automatically remove the dirty bit and replay the journal

[–] SuperSpruce 3 points 1 year ago

Note: The computer has an SSD where the OS lives and two HDDs, sda and sdb, set up in RAID 1 because the computer is 3.5 years old.

[–] SuperSpruce 2 points 1 year ago

[–] [email protected] 0 points 1 year ago

Read the messages on the screen. It's telling you how to check the logs for the error