centos – 无法启动kdump

2020-09-05 原文

我的系统总是崩溃.所以我决定启用kdump来查看问题,因为我无法在日志文件中看到可能的错误.

我跟着steps to set up kdump from a site here.我的服务器在CentOS 5.8和16GB RAM上运行.以下是我为配置kdump所执行的步骤：

1. Install kexec-tools,`yum install kexec-tools` and follow the installation steps
2. Edit the /boot/grub/grub.conf to configure the kdump memory usage
3. Edit the /etc/kdump.cof to configure the target type to /var/crash/ and core_collector
4. Enable kdump through `chkconfig kdump on`.
5. Reboot the server

当我运行服务kdump状态时,它说Kdump不能运行.
我应该怎么做才能使kdump运作起来.我错过了配置的东西吗？
我在下面包含了/boot/grub/grub.conf和/etc/kdump.conf的内容

Bellow是文件/boot/grub/grub.conf的内容

# grub.conf generated by anaconda
#
# Note that you do not have to rerun grub after making changes to this file
# NOTICE:  You have a /boot partition.  This means that
#          all kernel and initrd paths are relative to /boot/,eg.
#          root (hd0,0)
#          kernel /vmlinuz-version ro root=/dev/sda3
#          initrd /initrd-version.img
#boot=/dev/sda
default=0
timeout=5
splashimage=(hd0,0)/grub/splash.xpm.gz
hiddenmenu
title CentOS (2.6.18-308.el5)
        root (hd0,0)
        kernel /vmlinuz-2.6.18-308.el5 ro root=LABEL=/
crashkernel=128M
        initrd /initrd-2.6.18-308.el5.img

以下是文件/etc/kdump.conf的内容

# Configures where to put the kdump /proc/vmcore files
#
# This file contains a series of commands to perform (in order) when a
# kernel crash has happened and the kdump kernel has been loaded.  Directives in
# this file are only applicable to the kdump initramfs,and have no effect if
# the root filesystem is mounted and the normal init scripts are processed
#
# Currently only one dump target and path may be configured at once
# if the configured dump target fails,the default action will be preformed
# the default action may be configured with the default directive below.  If the
# configured dump target succedes
#
# For filesystem based dump,it's recommended to use UUID and LABEL
# instead of device name in dump target.
#
# See the kdump.conf(5) man page for details of configuration directives

#raw /dev/sda5
#ext3 /dev/sda3
#ext3 LABEL=/boot
#ext3 UUID=03138356-5e61-4ab3-b58e-27507ac41937
#net my.server.com:/export/tmp
#net user@my.server.com
path /var/crash
core_collector makedumpfile -c --message-level 1
#core_collector cp --sparse=always
#link_delay 60
#kdump_post /var/crash/scripts/kdump-post.sh
#extra_bins /usr/bin/lftp
#disk_timeout 30
#extra_modules gfs2
#options modulename options
#default shell
#sshkey /root/.ssh/kdump_id_rsa

我还注意到我的/boot/grub/grub.conf文件与本教程中的示例grub.conf文件不同.它们有两行不同：

From tutorial
   kernel /vmlinuz-2.6.32-220.el6.x86_64 ro root=/dev/sda3
   initrd /initramfs-2.6.32-220.el6.x86_64.img

From own conf 
   kernel /vmlinuz-2.6.18-308.el5 ro root=LABEL=/
   initrd /initrd-2.6.18-308.el5.img

这些行会导致kdump无法启动吗？

[编辑1]
/ var / log / messages的内容

Feb 25 02:18:28 61540 kernel: Command line: ro root=LABEL=/ crashkernel=128M
    Feb 25 02:18:28 61540 kernel: BIOS-provided physical RAM map:
    Feb 25 02:18:28 61540 kernel:  BIOS-e820: 0000000000010000 - 000000000009a000 (usable)
    Feb 25 02:18:28 61540 kernel:  BIOS-e820: 000000000009f800 - 00000000000a0000 (reserved)
    Feb 25 02:18:28 61540 kernel:  BIOS-e820: 00000000000f0000 - 0000000000100000 (reserved)
    Feb 25 02:18:28 61540 kernel:  BIOS-e820: 0000000000100000 - 00000000cfda0000 (usable)
    Feb 25 02:18:28 61540 kernel:  BIOS-e820: 00000000cfda0000 - 00000000cfdd1000 (ACPI NVS)
    Feb 25 02:18:28 61540 kernel:  BIOS-e820: 00000000cfdd1000 - 00000000cfe00000 (ACPI data)
    Feb 25 02:18:28 61540 kernel:  BIOS-e820: 00000000cfe00000 - 00000000cff00000 (reserved)
    Feb 25 02:18:28 61540 kernel:  BIOS-e820: 00000000e0000000 - 00000000f0000000 (reserved)
    Feb 25 02:18:28 61540 kernel:  BIOS-e820: 00000000fec00000 - 0000000100000000 (reserved)
    Feb 25 02:18:28 61540 kernel:  BIOS-e820: 0000000100000000 - 000000042f000000 (usable)
    Feb 25 02:18:28 61540 kernel: DMI 2.4 present.
    Feb 25 02:18:28 61540 kernel: No NUMA configuration found
    Feb 25 02:18:28 61540 kernel: Faking a node at 0000000000000000-000000042f000000
    Feb 25 02:18:28 61540 kernel: Bootmem setup node 0 0000000000000000-000000042f000000
    Feb 25 02:18:28 61540 kernel: Memory for crash kernel (0x0 to 0x0) notwithin permissible range
    Feb 25 02:18:28 61540 kernel: disabling kdump
    Feb 25 02:44:39 61540 kdump: No crashkernel parameter was specified or crashkernel memory reservation Failed
    Feb 25 02:44:39 61540 kdump: Failed to start up

[编辑2]
或者我应该将代码ro root = LABEL =更改为ro root = / dev / sda3？

title CentOS (2.6.18-308.el5)
        root (hd0,0)
        kernel /vmlinuz-2.6.18-308.el5 ro root=LABEL=/
crashkernel=128M
        initrd /initrd-2.6.18-308.el5.img

看起来您将crashkernel参数放入新行.这就是Kdump不是操作消息的原因.所有内核参数必须与内核放在同一行：

title CentOS (2.6.18-308.el5)
        root (hd0,0)
        kernel /vmlinuz-2.6.18-308.el5 ro root=LABEL=/ crashkernel=128M
        initrd /initrd-2.6.18-308.el5.img

重新启动后,看一下/ var / log / messages,你会看到如下内容：

localhost kdump: kexec: loaded kdump kernel
localhost kdump: started up

和：

# /etc/init.d/kdump start
Starting kdump:                                            [  OK  ]
# /etc/init.d/kdump status
Kdump is operational

kdump: No crashkernel parameter was specified or crashkernel memory reservation Failed
kdump: Failed to start up

根据this文档,试试这个：

crashkernel = 128M @ 16M

centos – 无法启动kdump的更多相关文章

CentOS 8.2服务器上安装最新版Node.js的方法

这篇文章主要介绍了CentOS 8.2服务器上安装最新版Node.js的方法，本文给大家介绍的非常详细，对大家的学习或工作具有一定的参考借鉴价值,需要的朋友可以参考下
Spring Boot 集成Redisson实现分布式锁详细案例

这篇文章主要介绍了Spring Boot 集成Redisson实现分布式锁详细案例，文章围绕主题展开详细的内容介绍，具有一定的参考价值，需要的朋友可以参考一下
spring boot实现文件上传

这篇文章主要为大家详细介绍了spring boot实现文件上传，文中示例代码介绍的非常详细，具有一定的参考价值，感兴趣的小伙伴们可以参考一下
Spring Boot + EasyExcel实现数据导入导出

这篇文章主要介绍了Spring Boot+EasyExcel实现数据导入导出，文章围绕主题展开详细的内容介绍，具有一定的参考价值，需要的朋友可以参考一下
Spring Boot日志的打印与持久化详细解析

Spring Boot默认使用SLF4J+Logback 记录日志，并提供了默认配置，即使我们不进行任何额外配，也可以使用SLF4J+Logback进行日志输出
SpringBoot浅析安全管理之Spring Security配置

安全管理是软件系统必不可少的的功能。根据经典的“墨菲定律”——凡是可能，总会发生。如果系统存在安全隐患，最终必然会出现问题，这篇文章主要介绍了SpringBoot安全管理Spring Security基本配置
centos+php+coreseek+sphinx+mysql之一coreseek安装篇

这篇文章主要介绍了centos+php+coreseek+sphinx+mysql之一coreseek安装篇的相关资料,非常不错具有参考借鉴价值，需要的朋友可以参考下
SpringBoot与Spring中数据缓存Cache超详细讲解

我们知道内存读取速度远大于硬盘读取速度，当需要重复获取相同数据时，一次一次的请求数据库或者远程服务，导致在数据库查询或者远程方法调用上小号大量的时间，最终导致程序性能降低，这就是数据缓存要解决的问题，学过计算机组成原理或者操作系统的同学们应该比较熟悉
Spring Boot 利用注解方式整合 MyBatis

这篇文章主要介绍了Spring Boot 利用注解方式整合 MyBatis，文章围绕主主题的相关资料展开详细的内容介绍，需要的小伙伴可以参考一下
Spring Boot自动配置的原理及@Conditional条件注解

这篇文章主要介绍了Spring Boot自动配置的原理及@Conditional条件注解，文章围绕主题展开详细的内容介绍，具有一定的参考价值，感兴趣的朋友可以参考一下

随机推荐

在airgapped(离线)CentOS 6系统上安装yum软件包

我有一个CentOS6系统,出于安全考虑,它已经被空气泄漏.它可能从未连接到互联网,如果有,它很长时间没有更新.我想将所有.rpm软件包放在一个驱动器上,这样它们就可以脱机安装而无需查询互联网.但是,我在测试VM上遇到的问题是,即使指定了本地路径,yum仍然会挂起并尝试从在线存储库进行更新.另外,有没有办法使用yum-utils/yumdownloader轻松获取该包的所有依赖项和所有依赖项？目前
centos – 命名在日志旋转后停止记录到rsyslog

CentOS6.2,绑定9.7.3,rsyslog4.6.2我最近设置了一个服务器,我注意到在日志轮换后,named已停止记录到/var/log/messages.我认为这很奇怪,因为所有日志记录都是通过rsyslog进行的,并且named不会直接写入日志文件.这更奇怪,因为我在更新区域文件后命名了HUPed,但它仍然没有记录.在我停止并重新启动命名后,记录恢复.这里发生了什么？
centos – 显示错误的磁盘大小

对于其中一个磁盘,Df-h在我的服务器上显示错误的空白区域：Cpanel表明它只有34GB免费,但还有更多.几分钟前,我删除了超过80GB的日志文件.所以,我确信它完全错了.fdisk-l/dev/sda2也显示错误：如果没有格式化,我该怎么做才能解决这个问题？并且打开文件描述符就是它需要使用才能做到这一点.所以…使用“lsof”并查找已删除的文件.重新启动写入日志文件的服务,你很可能会看到空间可用.
如何在centos 6.9上安装docker-ce 17？

我目前正在尝试在centOS6.9服务器上安装docker-ce17,但是,当运行yuminstalldocker-ce时,我收到以下错误：如果我用跳过的标志运行它我仍然得到相同的消息,有没有人知道这方面的方法？
centos – 闲置工作站的异常负载平均值

我有一个新的工作站,具有不寻常的高负载平均值.机器规格是：>至强cpu>256GB的RAM>4x512GBSSD连接到LSI2108RAID控制器我从livecd安装了CentOS6.564位,配置了分区,网络,用户/组,并安装了一些软件,如开发工具和MATLAB.在启动几分钟后,工作站负载平均值的值介于0.5到0.9之间.但它没有做任何事情.因此我无法理解为什么负载平均值如此之高.你能帮我诊断一下这个问题吗？
centos – Cryptsetup luks – 检查内核是否支持aes-xts-plain64密码

我在CentOS5上使用cryptsetupluks加密加密了一堆硬盘.一切都很好,直到我将系统升级到CentOS6.现在我再也无法安装磁盘了.使用我的关键短语装载：我收到此错误：在/var/log/messages中：有关如何装载的任何想法？找到解决方案问题是驱动器使用大约512个字符长的交互式关键短语加密.出于某种原因,CentOS6中的新内核模块在由旧版本创建时无法正确读取512个字符的加密密钥.似乎只会影响内核或cryptsetup的不同版本,因为在同一系统上创建和打开时,512字符的密钥将起作用
centos – 大量ssh登录尝试

22个我今天登录CentOS盒找到以下内容这是过去3天内的11次登录尝试.WTF？请注意,这是我从我的提供商处获得的全新IP,该盒子是全新的.我还没有发布任何关于此框的内容.为什么我会进行如此大量的登录尝试？是某种IP/端口扫描？基本上有4名匪徒,其中2名来自中国,1名来自香港,1名来自Verizon.这只发生在SSH上.HTTP上没有问题.我应该将罪魁祸首子网路由吗？你们有什么建议？
centos – kswap使用100％的CPU,即使有100GB的RAM也可用

>Linux内核是否应该足够智能,只需从内存中清除旧缓存页而不是启动kswap？
centos – Azure将VM从A2 / 3调整为DS2 v2

我正在尝试调整前一段时间创建的几个AzureVM,从基本的A3和标准A3到标准的DS2v2.我似乎没有能力调整到这个大小的VM.必须从头开始重建服务器会有点痛苦.如果它有所不同我在VM中运行CentOS,每个都有一个带有应用程序和操作系统的磁盘.任何人都可以告诉我是否可以在不删除磁盘的情况下删除VM,创建新VM然后将磁盘附加到新VM？
centos – 广泛使用RAM时服务器计算速度减慢

我在非常具体的情况下遇到服务器速度下降的问题.事实是：>1)我使用计算应用WRF>2)我使用双XeonE5-2620v3和128GBRAM(NUMA架构–可能与问题有关！