Project

General

Profile

Actions

Enhancement #46

closed

Major Backup Rework

Added by Marc Dequènes about 14 years ago. Updated almost 13 years ago.

Status:
Resolved
Priority:
High
Category:
Service :: Backup
Start date:
2010-04-11
Due date:
% Done:

100%

Estimated time:
Patch Available:
No
Confirmed:
No
Branch:
Entity:
DuckCorp
Security:
No
Help Needed:

Description

Tasks :
  • switch to bacula 5 and use interresting new features
  • rework config with includes
  • recalculate schedules and timeouts with current amount of data
  • ensure Elwing and Toushirou data are no more a problem

Related issues 1 (0 open1 closed)

Blocked by DuckCorp Infrastructure - Enhancement #217: improve sendfile supportResolvedMarc Dequènes2011-05-08

Actions
Actions #1

Updated by Marc Dequènes almost 14 years ago

  • Priority changed from High to Urgent
  • Security set to No

We are low on space in the backup partition...

Actions #2

Updated by Marc Dequènes almost 14 years ago

  • Priority changed from Urgent to Immediate

Recent backup just blocked, as we lack space, so the config rework must happen soon...

Actions #3

Updated by Marc Dequènes almost 14 years ago

Bacula 5.0.2 is now in unstable !

Actions #4

Updated by Marc Dequènes almost 14 years ago

  • Status changed from New to In Progress
  • % Done changed from 0 to 30

Bacula 5.0.2 installed, using a few new features.

We should also take care of the CriticalDataBackup run scripts, which are deactivated, it could be useful.

Actions #5

Updated by Marc Dequènes almost 14 years ago

  • Priority changed from Immediate to High

A few parameters have been changed to improve the situation, but still we need to work more on the subject.

Downgrading the severity as the situation is sustainable.

Actions #6

Updated by Marc Dequènes over 13 years ago

  • % Done changed from 30 to 40

Configuration was split properly.

The Storage directive was moved from Jobdefs to Pools, to later allow Copy/Migration to remote Pools.

Actions #7

Updated by Marc Dequènes over 13 years ago

We need to check directory listing, in order to not forget anything and revolve the following minor issues:

03-Dec 06:40 Elwing-fd JobId 8609:      Could not stat "/var/lib/ejabberd": ERR=No such file or directory
03-Dec 06:40 Elwing-fd JobId 8609:      Could not stat "/var/lib/tftpboot": ERR=No such file or directory
03-Dec 06:40 Elwing-fd JobId 8609:      Could not stat "/www": ERR=No such file or directory
03-Dec 06:40 Elwing-fd JobId 8609:      Could not stat "/data/mldonkey/*.ini": ERR=No such file or directory
03-Dec 08:21 Elwing-fd JobId 8609:      Could not stat "/usr/share/rbot/plugins": ERR=No such file or directory
03-Dec 08:40 Orfeo-fd JobId 8610:      Could not stat "/var/lib/bitlbee": ERR=No such file or directory
03-Dec 23:18 Daneel-dir JobId 8612: Fatal error: Job canceled because max start delay time exceeded.
03-Dec 23:18 Daneel-dir JobId 8613: Fatal error: Job canceled because max start delay time exceeded.
04-Dec 03:30 Elwing-fd JobId 8620: Error: /var/lib/cacti/rra/toushirou_via_hivane_threads_connected_367.rrd mtime changed during backup.
04-Dec 03:30 Elwing-fd JobId 8620:      Could not stat "/var/lib/ejabberd": ERR=No such file or directory
04-Dec 03:30 Elwing-fd JobId 8620:      Could not stat "/var/lib/tftpboot": ERR=No such file or directory
04-Dec 03:30 Elwing-fd JobId 8620:      Could not stat "/www": ERR=No such file or directory
04-Dec 03:30 Elwing-fd JobId 8620:      Could not stat "/data/mldonkey/*.ini": ERR=No such file or directory
04-Dec 03:30 Elwing-fd JobId 8620:      Could not stat "/usr/share/rbot/plugins": ERR=No such file or directory
04-Dec 03:30 Orfeo-fd JobId 8621:      Could not stat "/var/lib/bitlbee": ERR=No such file or directory

Actions #8

Updated by Marc Dequènes over 13 years ago

  • % Done changed from 40 to 50

Fixed the previous list, but still need to check for missing parts.

Actions #9

Updated by Marc Dequènes almost 13 years ago

  • % Done changed from 50 to 60
Today, fixed:
  • missing db-backup group/user on Elwing
  • remaining /var/lib/postgresql on Toushirou (script then detected a PG server, which was wrong)
Actions #10

Updated by Marc Dequènes almost 13 years ago

As having another backup host is not happening anytime soon, and the need for backup space keeps growing over time, i suggest the following course of action:
  • find a way to split the PV on Daneel, one for the system and another for backup data
  • keeps using soft RAID1 for the first PV, but use both backup partitions as PV in the same VG, to double the size of backup data space
    These are big changes, and i don't know how to do that without reinstalling from scratch.
Actions #11

Updated by Marc Dequènes almost 13 years ago

  • % Done changed from 60 to 90

Changes done, recipe added as tip in the wiki, now watching over it.

Actions #12

Updated by Marc Dequènes almost 13 years ago

  • Status changed from In Progress to Resolved
  • % Done changed from 90 to 100
Actions

Also available in: Atom PDF