Project

General

Profile

Actions

Bug #102

closed

Elwing is Borken

Added by Marc Dequènes almost 14 years ago. Updated over 13 years ago.

Status:
Resolved
Priority:
Normal
Category:
System :: Hardware
Start date:
2010-06-14
Due date:
% Done:

100%

Estimated time:
Patch Available:
No
Confirmed:
No
Branch:
Entity:
DuckCorp
Security:
No
Help Needed:

Description

The master disk died.


Related issues 3 (0 open3 closed)

Related to DuckCorp Infrastructure - Enhancement #54: Remove emir and php4 on Elwing ResolvedMarc Dequènes2010-04-11

Actions
Related to DuckCorp Infrastructure - Bug #104: High io wait on ElwingRejectedMarc Dequènes2010-06-17

Actions
Related to DuckCorp Infrastructure - Enhancement #117: Plans for the futur of HQ server/NAS/...ResolvedMarc Dequènes2010-08-02

Actions
Actions #1

Updated by Marc Dequènes almost 14 years ago

  • Status changed from New to In Progress
  • % Done changed from 0 to 20

Bought 2 new disks yesterday, and reinstalled the base system with basic networking services today. It uses RAID 1, like on Daneel, but with GPT (and GRUB2).

Actions #2

Updated by Marc Dequènes almost 14 years ago

The WD disks in the "desktop" class are not working well on RAID, as they williningly made things impracticable (thiefs!). I tried to improve thing by using the wdidle3 utility, to have this silly "intellipark" kikoolol feature out of the way, and discovered one of them is broken, so i returned it.

I plan to buy new disks and reuse or sell the formers. Interresting items:
Actions #3

Updated by Marc Dequènes almost 14 years ago

  • % Done changed from 20 to 50

Many services were reinstalled.

Actions #4

Updated by Marc Dequènes almost 14 years ago

  • Priority changed from Urgent to Normal
  • % Done changed from 50 to 90
Almost everything is reinstalled. I need to:
  • check if Cacti stats are all working nicely
  • recreate the live-helper network image

Then i need to double-check nothing was forgotten.

Downgrading the severity, as most of the job is done and important services work well since a few weeks already.

Actions #5

Updated by Marc Dequènes over 13 years ago

Cacti broken stats:
  • Daneel
    • interfaces: eth0 (at least traffic, always 0)
  • Elwing
    • CPU
    • Apache statistics (all)
    • interfaces: ppp0 / tun0
    • LDAP
    • mysql innodb: transactions / lock waits / memory usage / lock structures / semaphores waits / semaphores wait time / tables in use
    • mysql relay logs
    • mysql replication (slave lag)
  • Gwaihir
    • space on /jffs
    • processes (not nan anymore, but always 0 after graph+rra reconstruction)
  • Orfeo
    • LDAP
    • postgresql
  • Toushirou
    • LDAP
    • mysql (similar to Elwing)
Actions #6

Updated by Marc Dequènes over 13 years ago

The returned drive is still being processed, the vendor confirmed it to me by phone.

Actions #7

Updated by Marc Dequènes over 13 years ago

  • % Done changed from 90 to 50

I need to redo the work on Elwing-NG.

Services transfered:
  • basic network services (DHCP, DNS, radvd, ...)
  • LDAP

The NAS services are on the way, but moving files is a long process.

Actions #8

Updated by Marc Dequènes over 13 years ago

  • % Done changed from 50 to 90

All services should be allright.

PXE is tested OK. Webstats work again. Only Cacti stats should be rechecked.

Let's have another look in a few days, to ensure nothing was forgotten.

Elwing's system behaves quite well.

Actions #9

Updated by Marc Dequènes over 13 years ago

  • Status changed from In Progress to Resolved
  • % Done changed from 90 to 100

Cacti stats have been repaired a bit. Most remaining problems are related to #71 and will be addressed in this ticket.

The old disk is kept just in case, and this ticket is now finished. Plans for the two WD drives will be addessed in another ticket. The broken disk exchange is taken care of by #136.

Actions #10

Updated by Marc Dequènes over 13 years ago

  • Category changed from System :: Base to System :: Hardware
Actions

Also available in: Atom PDF