当前位置: 首页 > news >正文

做设计去那些网站找素材网络服务商英文

做设计去那些网站找素材,网络服务商英文,最全的游戏网站,网站用户登录流程图slurm 23.11.0集群 debian 11.5 安装 用途 Slurm(Simple Linux Utility for Resource Management#xff0c; http://slurm.schedmd.com/ )是开源的、具有容错性和高度可扩展的Linux集群超级计算系统资源管理和作业调度系统。超级计算系统可利用Slurm对资源和作业进行管理 http://slurm.schedmd.com/ )是开源的、具有容错性和高度可扩展的Linux集群超级计算系统资源管理和作业调度系统。超级计算系统可利用Slurm对资源和作业进行管理以避免相互干扰提高运行效率。所有需运行的作业无论是用于程序调试还是业务计算都可以通过交互式并行 srun 、批处理式 sbatch 或分配式 salloc 等命令提交提交后可以利用相关命令查询作业状态等。 架构 Slurm采用slurmctld服务守护进程作为中心管理器用于监测资源和作业为了提高可用性还可以配置另一个备份冗余管理器。各计算节点需启动slurmd守护进程以便被用于作为远程shell使用等待作业、执行作业、返回状态、再等待更多作业。slurmdbd(Slurm DataBase Daemon)数据库守护进程非必需建议采用也可以记录到纯文本中等可以将多个slurm管理的集群的记账信息记录在同一个数据库中。还可以启用slurmrestd(Slurm REST API Daemon)服务非必需该服务可以通过REST API与Slurm进行交互所有功能都对应的API。用户工具包含 srun 运行作业、 scancel 终止排队中或运行中的作业、 sinfo 查看系统状态、 squeue 查看作业状态、 sacct 查看运行中或结束了的作业及作业步信息等命令。 sview 命令可以图形化显示系统和作业状态可含有网络拓扑。 scontrol 作为管理工具可以监控、修改集群的配置和状态信息等。用于管理数据库的命令是 sacctmgr 可认证集群、有效用户、有效记账账户等。 xxxxxxxxxx10 1#192.168.86.134 - 192.168.86.1362cd3ssh-keygen 4sed -i ‘s/#PermitRootLogin prohibit-password/PermitRootLogin yes/’ /etc/ssh/sshd_config5systemctl restart ssh6passwd7#192.168.86.134 8ssh-copy-id slurm-head9ssh-copy-id slurm-db10ssh-copy-id slurm-computebash SlurmDBD Node slurm-smdslurm-smd-slurmdbd Head Node (slurmctld node) slurm-smdslurm-smd-slurmctld Compute Nodes (slurmd node) slurm-smdslurm-smd-slurmd 192.168.86.134 slurm-head # 控制节点 Head Node 192.168.86.135 slurm-db #数据节点 SlurmDBD Node 192.168.86.136 slurm-compute #计算节点 Compute Nodes 注意如果是老服务器已有服务在运行可不改具体的hostname只要对应名称进行替换 修改主机名 # 192.168.86.134 hostnamectl set-hostname slurm-head # 192.168.86.135 hostnamectl set-hostname slurm-db # 192.168.86.136 hostnamectl set-hostname slurm-compute修改/etc/hosts #192.168.86.134 - 192.168.86.136 echo 192.168.86.134 slurm-head 192.168.86.135 slurm-db 192.168.86.136 slurm-compute /etc/hosts cat /etc/hosts修改debian的apt源 #192.168.86.134 - 192.168.86.136 mv /etc/apt/sources.list{,.bak}echo deb http://mirrors.tuna.tsinghua.edu.cn/debian/ bookworm main contrib non-free non-free-firmware # deb-src http://mirrors.tuna.tsinghua.edu.cn/debian/ bookworm main contrib non-free non-free-firmwaredeb http://mirrors.tuna.tsinghua.edu.cn/debian/ bookworm-updates main contrib non-free non-free-firmware # deb-src http://mirrors.tuna.tsinghua.edu.cn/debian/ bookworm-updates main contrib non-free non-free-firmwaredeb http://mirrors.tuna.tsinghua.edu.cn/debian/ bookworm-backports main contrib non-free non-free-firmware # deb-src http://mirrors.tuna.tsinghua.edu.cn/debian/ bookworm-backports main contrib non-free non-free-firmwaredeb https://security.debian.org/debian-security bookworm-security main contrib non-free non-free-firmware # deb-src https://security.debian.org/debian-security bookworm-security main contrib non-free non-free-firmware /etc/apt/sources.listapt update apt -y install vim wget同步时间 #192.168.86.134 - 192.168.86.136 apt update apt install ntpdate -y ntpdate ntp1.aliyun.com远程免密 #192.168.86.134 - 192.168.86.136 cd ssh-keygen sed -i s/#PermitRootLogin prohibit-password/PermitRootLogin yes/ /etc/ssh/sshd_config systemctl restart ssh #192.168.86.134 ssh-copy-id slurm-head ssh-copy-id slurm-db ssh-copy-id slurm-compute安装munge #192.168.86.134 - 192.168.86.136 export MUNGEUSER1120 groupadd -g $MUNGEUSER munge useradd -m -c MUNGE Uid N Gid Emporium -d /var/lib/munge -u $MUNGEUSER -g munge -s /sbin/nologin munge 3.安装munge软件 #192.168.86.134 - 192.168.86.136 apt-get install -y munge libmunge-dev libmunge2 rng-tools make hwloc libhwloc-dev git gcc build-essential fakeroot devscripts debhelper libncurses-dev libgtk2.0-dev libpam0g-dev libperl-dev liblua5.3-dev libhwloc-dev dh-exec librrd-dev libipmimonitoring-dev hdf5-helpers libfreeipmi-dev libhdf5-dev man2html libcurl4-openssl-dev libpmix-dev libhttp-parser-dev libyaml-dev libjson-c-dev libjwt-dev liblz4-dev libdbus-1-dev librdkafka-dev libreadline-dev perl libpam0g-dev liblua5.3-dev libhwloc-dev#192.168.86.135 apt-get install mariadb-server libmariadb-dev-compat libmariadb-dev -y 4.添加配置文件 rngd -r /dev/urandom dd if/dev/urandom bs1 count1024 /etc/munge/munge.key chown munge: /etc/munge/munge.key chmod 400 /etc/munge/munge.key chown -R munge: /var/lib/munge chown -R munge: /var/run/munge chown -R munge: /var/log/mungescp /etc/munge/munge.key rootslurm-db:/etc/munge/ scp /etc/munge/munge.key rootslurm-compute:/etc/munge/ 6.启动服务systemctl restart munge systemctl enable munge systemctl status munge #192.168.86.135 - 192.168.86.136rngd -r /dev/urandom chmod 700 /etc/munge chown -R munge: /etc/munge chown -R munge: /var/lib/munge chown -R munge: /var/run/munge chown -R munge: /var/log/mungesystemctl start munge systemctl enable munge systemctl status munge 安装slurm #添加用户 #192.168.86.134 - 192.168.86.136 groupadd slurm useradd -r -M -g slurm slurm ## 编译安装 #192.168.86.134 - 192.168.86.136 wget https://download.schedmd.com/slurm/slurm-23.11.0.tar.bz2 tar -xf slurm-23.11.0.tar.bz2 cd slurm-23.11.0/ ./configure --enable-debug --prefix/usr/local/slurm make make install#查看是否缺少插件 #192.168.86.135 ls /usr/local/slurm/lib/slurm|grep accounting_storag|grep mysql ##启动并且登陆创建数据库 systemctl start mariadb mysql -u rootCREATE DATABASE slurm_acct_db; CREATE USER slurm192.168.86.135; SET PASSWORD FOR slurm192.168.86.135 PASSWORD(mypassword); GRANT ALL PRIVILEGES ON slurm_acct_db.* TO slurm192.168.86.135; FLUSH PRIVILEGES; EXIT;CREATE DATABASE slurm_acct_db; CREATE USER slurm192.168.86.134; SET PASSWORD FOR slurm192.168.86.134 PASSWORD(mypassword); GRANT ALL PRIVILEGES ON slurm_acct_db.* TO slurm192.168.86.134; FLUSH PRIVILEGES; EXIT;## 创建文件夹并且赋权 mkdir -p /var/log/slurm/ chown slurm: /var/log/slurm/ ## 编写配置文件 vim slurmdbd.conf AuthTypeauth/mungeDbdAddrlocalhost DbdHostlocalhost #DbdPort7031 SlurmUserslurm MessageTimeout300 DebugLeveldebug5 DefaultQOSnormal LogFile/var/log/slurm/slurmdbd.log PidFile/var/run/slurmdbd.pid StorageTypeaccounting_storage/mysql StorageHostlocalhost StoragePort3306 StoragePassmypassword StorageUserslurm StorageLocslurm_acct_db ##更改文件权限 chown slurm: /usr/local/slurm/etc/slurmdbd.conf chmod 600 /usr/local/slurm/etc/slurmdbd.conf systemctl start slurmdbd systemctl status slurmdbdsystemctl enable slurmdbd#192.168.86.134 - 192.168.86.136 ##控制节点或计算节点 cp -rf etc/ /usr/local/slurm/ cp etc/slurm*.service /lib/systemd/system/cd /usr/local/slurm/etc cp slurm.conf.example slurm.conf cp cgroup.conf.example cgroup.conf cp slurmdbd.conf.example slurmdbd.conf ## 修改cgroup配置 echo CgroupMountpoint/sys/fs/cgroup cgroup.conf cat cgroup.confvim slurm.confClusterNamemyCluster SlurmctldHostslurm-head #SlurmctldHost # MpiDefaultnone ProctrackTypeproctrack/cgroup ReturnToService1 SlurmctldPidFile/var/run/slurmctld.pid SlurmctldPort6817 SlurmdPidFile/var/run/slurmd.pid SlurmdPort6818 SlurmdSpoolDir/var/spool/slurmd SlurmdUserroot StateSaveLocation/var/spool/slurmctld SwitchTypeswitch/none TaskPlugintask/affinity #TaskPlugintask/cgroup# # # TIMERS InactiveLimit0 KillWait30 MinJobAge300 SlurmctldTimeout120 SlurmdTimeout300Waittime0# SCHEDULING SchedulerTypesched/backfill SelectTypeselect/cons_tres SelectTypeParametersCR_Core_Memory # # # JOB PRIORITY AccountingStorageEnforceqos,limits AccountingStorageHostslurm-db AccountingStoragePass/var/run/munge/munge.socket.2 AccountingStorageTypeaccounting_storage/slurmdbd AccountingStorageUserslurm #AccountingStorageTRESgres/gpu JobCompHostslurm-db JobCompLocslurm_acct_db JobCompPassmypassword JobCompTypejobcomp/none JobCompUserslurm JobAcctGatherFrequency30 JobAcctGatherTypejobacct_gather/linux SlurmctldDebuginfo SlurmctldLogFile/var/log/slurm/slurmctld.log SlurmdDebuginfo SlurmdLogFile/var/log/slurm/slurmd.log #GresTypesgpuNodeNameslurm-head RealMemory1935 StateUNKNOWN NodeNameslurm-db RealMemory1935 StateUNKNOWN NodeNameslurm-compute RealMemory1935 StateUNKNOWN PartitionNamecompute Nodesslurm-head,slurm-compute DefaultYES MaxTime168:00:00 StateUP## 创建一些文件赋权 mkdir /var/log/slurm/touch /var/log/slurm/slurmctld.log chown slurm: /var/log/slurm/slurmctld.log chmod urw /var/log/slurm/slurmctld.log touch /var/log/slurm/slurmd.log chown slurm: /var/log/slurm/slurmd.log chmod urw /var/log/slurm/slurmd.log mkdir -p /var/spool/slurmctld chown slurm: /var/spool/slurmctldmkdir -p /var/spool/slurmd chown slurm: /var/spool/slurmd chown slurm: /var/log/slurm/systemctl restart slurmctld systemctl status slurmctldsystemctl restart slurmd systemctl status slurmdsystemctl enable slurmctldsystemctl enable slurmd ***有问题,根据提示执行以下命令看下面的踩坑环节*** journalctl -xeu slurmd journalctl -xeu slurmctld journalctl -xeu slurmdbd展示 集群状态验证 任务运行验证 数据库写入验证与slurmdbd通信 192.168.86.135机器 踩坑环节 Couldn’t find the specified plugin name for cgroup/v2 looking at all files ░░ The job identifier is 1747. Dec 07 06:28:09 slurm-head slurmd[59887]: slurmd: error: Couldnt find the specified plugin name for cgroup/v2 looking at all files Dec 07 06:28:09 slurm-head slurmd[59887]: slurmd: error: cannot find cgroup plugin for cgroup/v2 Dec 07 06:28:09 slurm-head slurmd[59887]: slurmd: error: cannot create cgroup context for cgroup/v2 Dec 07 06:28:09 slurm-head slurmd[59887]: slurmd: error: Unable to initialize cgroup plugin Dec 07 06:28:09 slurm-head slurmd[59887]: slurmd: error: slurmd initialization failed Dec 07 06:28:09 slurm-head systemd[1]: slurmd.service: Main process exited, codeexited, status1/FAILURE ░░ Subject: Unit process exited这是因为缺少/usr/local/slurm/lib/slurm/cgroup_v2.so不要看网上的很扯淡基本上没有参考价值 解决 #老老实实吧以下命令执行一遍 apt-get install -y munge libmunge-dev libmunge2 rng-tools make hwloc libhwloc-dev git gcc build-essential fakeroot devscripts debhelper libncurses-dev libgtk2.0-dev libpam0g-dev libperl-dev liblua5.3-dev libhwloc-dev dh-exec librrd-dev libipmimonitoring-dev hdf5-helpers libfreeipmi-dev libhdf5-dev man2html libcurl4-openssl-dev libpmix-dev libhttp-parser-dev libyaml-dev libjson-c-dev libjwt-dev liblz4-dev libdbus-1-dev librdkafka-dev libreadline-dev perl libpam0g-dev liblua5.3-dev libhwloc-dev #然后重新config make cd slurm-23.11.0/ make uninstall make clean ./configure --enable-debug --prefix/usr/local/slurm make make installslurmdbd无法运行排除数据库信息不对后 需要make前安装mariadb/mysql哪怕你的mysql/mariadb不跟slurmdbd在一台都是要装不然缺少组件 缺少/usr/local/slurm/lib/slurm/accounting_storage_mysql.so error: cgroup namespace ‘freezer’ not mounted. aborting 说明你方向错了没有按照我的配置先按“Couldn’t find the specified plugin name for cgroup/v2 looking at all files”解决方案解决然后cgrep.conf如下配置 #CgroupAutomountyes CgroupMountpoint/sys/fs/cgroup ConstrainCoresyes ConstrainDevicesyes ConstrainRAMSpaceyes ConstrainSwapSpaceyes
http://www.dnsts.com.cn/news/121984.html

相关文章:

  • 学校网站手机站的建设做内衣模特接广告网站
  • 网站app搭建二手房房产网站建设
  • 做网站用win2008系统广州番禺区酒店
  • 如何介绍自己的网站wordpress分段加载
  • 网站建站流程有哪些桂林人网
  • 商丘市住房和城乡建设厅网站设计师怎么弄个人网站
  • 梅州建站免费做 爱视频网站
  • 买2g 空间做下载网站中国临海建设规划局网站
  • 手机网站seo免费软件session WordPress
  • 合肥哪家网站公司好thinkphp网站开发实战教程
  • 新公司网站建设分录网络推广运营公司
  • 东莞网站建怎么修改自己公司网站
  • 网站开发学习网站秦皇岛做网站的公司
  • 石墨网站开发用logo做ppt模板下载网站
  • 网站404页面模板天河区发布
  • 重庆品牌型网站建设多少钱com域名续费一年要多少钱
  • 自己做的商业网站在那里发布wordpress设置默认头像
  • 做分类信息网站赚钱吗邹城住房城乡建设部网站
  • 江苏省城乡建设部网站首页网页设计工资多少
  • 免费动画模板素材网站深圳做网站做公司网站的公司
  • 找人做彩票网站多少钱珠海网站建设专业公司
  • 网站兼容手机wordpress标签别名
  • 万网主机网站建设视频一个网站开发流程图
  • 个人备案网站能做商城吗神州网站制作
  • 在线做c 题的网站那些做软件的网站
  • 网页设计网站怎么做唯品会网站建设 分析报告
  • 木门行业做网站有什么好处慈利网站开发
  • 网站建设公司推广广告语网站织梦后台怎么做
  • 给网站建设提意见赶集网站建设多少钱
  • 网站建设管理权限m版网站开发