Backup corrupted

gdassieu · October 19, 2024, 4:11pm

Hi Everyone,

I’ve been using Duplicati for several years to backup ~200K files totalling ~100GB to a Google Drive cloud backup, with a “7D:1D,4W:1W,12M:1M,100Y:1Y” retention policy (meaning I am doing frequent – daily – backups). I am using a dedicated Google account for the backup, so only Duplicati can modify the remote files.

As a good conscious guy, I regularly do a DRP test (once every 3 months).
My DRP test consists of (1) reinstalling duplicati, (2) rebuilding the database from the cloud backup storage, and (3) restoring the files.

Most of the times, it succeeds. But, unfortunately, every approximately one year it fails on step (2) as the database can not be rebuilt from the cloud storage (e.g. Error-Duplicati.Library.Main.Operation.RecreateDatabaseHandler-MissingFileDetected).

I typically workaround this by deleting the remote storage and starting with a new backup from scratch, which is obviously not ideal (I expect to always be able to rebuild the database and restore from the remote storage system).

I’d like to work together with Duplicati’s dev team to solve this problem for good.

Right now I have a “corrupt” remote backup which we can use for troubleshooting. What steps can I take to work with the development team to identify the root cause and solve it?

I’m a computer science engineer with a software dev background, so I can manage detailed technical instructions.

Thanks,

Gaston

gdassieu · October 20, 2024, 4:15pm

It looks like the problem was that initially my backup was created with 2.0.6.3 beta, but I tried to rebuild the database using the latest canary version (2.0.9.108).

After retrying using 2.0.8.1 beta, the database could be sucessfully rebuilt.

I’ll try to restore the data to confirm everything works, then I’ll do some more “divide and conquer” and report here.

For information, the error I got when trying to rebuild the database on 2.0.9.108 was:

Oct 20, 2024 12:39 AM: Failed while executing Repair "Gaston's profile (Cloud)" (id: 1)
code = Error (1), message = System.Data.SQLite.SQLiteException (0x800007BF): SQL logic error
no such column: Temporary
   at System.Data.SQLite.SQLite3.Prepare(SQLiteConnection cnn, String strSql, SQLiteStatement previous, UInt32 timeoutMS, String& strRemain)
   at System.Data.SQLite.SQLiteCommand.BuildNextCommand()
   at System.Data.SQLite.SQLiteDataReader.NextResult()
   at System.Data.SQLite.SQLiteDataReader..ctor(SQLiteCommand cmd, CommandBehavior behave)
   at System.Data.SQLite.SQLiteCommand.ExecuteNonQuery(CommandBehavior behavior)
   at Duplicati.Library.Main.Database.ExtensionMethods.ExecuteNonQuery(IDbCommand self, Boolean writeLog, String cmd, Object[] values)
   at Duplicati.Library.Main.Database.LocalRecreateDatabase.CleanupMissingVolumes()
   at Duplicati.Library.Main.Operation.RecreateDatabaseHandler.DoRun(LocalDatabase dbparent, Boolean updating, IFilter filter, NumberedFilterFilelistDelegate filelistfilter, BlockVolumePostProcessor blockprocessor)
   at Duplicati.Library.Main.Operation.RecreateDatabaseHandler.Run(String path, IFilter filter, NumberedFilterFilelistDelegate filelistfilter, BlockVolumePostProcessor blockprocessor)
   at Duplicati.Library.Main.Operation.RepairHandler.RunRepairLocal(IFilter filter)
   at Duplicati.Library.Main.Operation.RepairHandler.Run(IFilter filter)
   at Duplicati.Library.Main.Controller.<>c__DisplayClass16_0.<Repair>b__0(RepairResults result)
   at Duplicati.Library.Main.Controller.RunAction[T](T result, String[]& paths, IFilter& filter, Action`1 method)
   at Duplicati.Library.Main.Controller.RunAction[T](T result, IFilter& filter, Action`1 method)
   at Duplicati.Library.Main.Controller.Repair(IFilter filter)
   at Duplicati.Server.Runner.Run(IRunnerData data, Boolean fromQueue)

ts678 · October 20, 2024, 4:54pm

Oct 8 fix will presumably be out in the next Canary after current v2.0.9.108_canary_2024-10-03. Canary releases by definition are new and untested, so beware of bugs. Testing help does help.

Tracking down your annual issue if it’s on 2.0.6.3 might just find it’s fixed since, but working with something more current may be useful. It may involve lots of attempts to preserve historical info.

Ideally what one wants is probably a GitHub Issue with exact steps for anyone to repro a failure. Actually a pull request would be better, but heading the other way, one collects data for the devs.

Suggestions from me are available, and I collect a lot, and try to avoid a latent-problem situation. Sometimes this means test scripting to do a more intensive check than you (or Duplicati) now do.

kenkendk · October 20, 2024, 5:59pm

That exact error is created here:

github.com

duplicati/duplicati/blob/master/Duplicati/Library/Main/Operation/RecreateDatabaseHandler.cs#L408


      
          if (volumeID < 0)

              volumeID = ProbeForMatchingFilename(ref filename, restoredb);

          

          var missing = false;

          // Still broken, register a missing item

          if (volumeID < 0)

          {

              var p = VolumeBase.ParseFilename(filename);

              if (p == null)

                  throw new Exception(string.Format("Unable to parse filename: {0}", filename));

              Logging.Log.WriteErrorMessage(LOGTAG, "MissingFileDetected", null, "Remote file referenced as {0} by {1}, but not found in list, registering a missing remote file", filename, sf.Name);

              missing = true;

              volumeID = restoredb.RegisterRemoteVolume(filename, p.FileType, RemoteVolumeState.Temporary, tr);

          }

          

          bool anyChange = false;

          //Add all block/volume mappings

          foreach (var b in a.Blocks)

              restoredb.UpdateBlock(b.Key, b.Value, volumeID, tr, ref anyChange);

          

          restoredb.UpdateRemoteVolume(filename, missing ? RemoteVolumeState.Temporary : RemoteVolumeState.Verified, a.Length, a.Hash, tr);

The logic here is that it attempts to build a complete database that you can continue your backup from. A failure here does not mean that you cannot restore the data in full, but obviously it should just work.

The logic around the error indicates that one of the index files is read, and this dindex file references a dblock file that is no longer present. Usually, this gets sorted later down where it removes all missing files that are not needed. I think you see a different exception in the end?

That would be awesome!

When it fails, it should give the full name of the dblock file that is missing. If you have the original SQLite database, it should be possible to search that database and see when the file was deleted. This would be in the RemoteOperation table. If you can find the file there, it should be possible to backtrack and see what situation caused the file to be deleted.

I assume that the deletion was intentional from Duplicati as it would otherwise give an error message when listing files prior to running a new backup.

My guess is that this happens during compacting, and perhaps the order of the remote file deletion causes the dindex files to wrongly list a file that will soon be deleted. We do have a bunch of tests around compaction, but this could be something that requires a special setup to be triggered.

gdassieu:

For information, the error I got when trying to rebuild the database on 2.0.9.108 was:

Oct 20, 2024 12:39 AM: Failed while executing Repair "Gaston's profile (Cloud)" (id: 1)
code = Error (1), message = System.Data.SQLite.SQLiteException (0x800007BF): SQL logic error
no such column: Temporary

As mentioned by @ts678 this was discovered and has been fixed. A new canary build should fix the issue.

Jojo-1000 · October 20, 2024, 6:34pm

The missing file errors you saw previously (not this instance) might be caused by this:

github.com/duplicati/duplicati

Fix leftover/extra index volume after interrupted upload and more tests

duplicati:master ← Jojo-1000:fix-leftover-index

opened 08:28PM - 08 Oct 23 UTC

Jojo-1000

+692 -43

- Fixes part of #5023 - Also adds more test cases to track some known reproduci…ble issues (those that fail are category "Bug" and explicit only). ## Steps to reproduce - See `IssueTests.Issue5023ReferencedFileMissing` and `RepairHandlerTests.RepairExtraIndexFiles` # Current behavior - After interrupted dindex upload (but the upload finished), no index-block link is created - The next run uploads a new dindex file for the dblock - When repairing with an extra index file, the invalid ID -1 is inserted into the link table # New behavior - Index-block link is always created - The next run recognizes that the dindex file already exists and no extra is uploaded - When repairing with an extra index file, the correct ID is inserted into the link table ## Impact - Order of operations in repair handler was changed to fix the ID bug, but wrapped in a transaction to ensure that no inconsistencies occur. - The index block link is created *before* the upload is known to be completed (instead of after the upload). It will be removed together with the `RemoteVolume` entry at the start of the next run.

The issue was discussed here:

github.com/duplicati/duplicati

recreate "Remote file referenced as ..." error after interrupted backup then compact

opened 01:48PM - 10 Sep 23 UTC

ts678

- [x] I have searched open and closed issues for duplicates. - [x] I have searc…hed the [forum](https://forum.duplicati.com) for related topics. ---------------------------------------- ## Environment info - **Duplicati version**: 2.0.7.2 Canary - **Operating system**: Windows 10 - **Backend**: Local folder ## Description I can reliably (with about 10 minutes on an interrupt test script) get an error like > Remote file referenced as * dblock.zip by * dindex.zip, but not found in list, registering a missing remote file The error here is on recreate, but the one inside some restores can probably hit it. [Remote file referenced ... but not found in list, registering a missing remote file #4586](https://github.com/duplicati/duplicati/issues/4586) is possibly the same problem, but I have a better repro and analysis on this issue. It looks like interrupt after dindex put during backup can leave DB thinking there's a missing dindex even though the dindex put worked. The next backup fixes this, but uses a different dindex name. This is different from the case in repair with just a lost dindex. Possibly here it doesn't know the name because IndexBlockLink didn't get filled out. Regardless, the extra dindex file is harmless until dblock+new dindex from the fix get deleted by a compact. After that, there's old dindex pointing to a dblock that's not there. Before that, maybe recreate was happy because it found two dindex, but both pointed to the same dblock, and had the same information, so no harm done. Just a guess. Seems like a good time to stop for some feedback and questions based on data so far. Here's a high-level log which is now embellished with some output from an SQL query: > SELECT "Name" FROM "Remotevolume" WHERE "Type" = "Index" AND ( "State" = "Verified" OR "State" = "Uploaded" ) AND "ID" NOT IN (SELECT "IndexVolumeID" FROM "IndexBlockLink")' ``` Running backup at 2023-09-10 08:11:30.415663 Exit code 0 Running recreate at 2023-09-10 08:11:38.921437 Running repair at 2023-09-10 08:11:38.921437 Exit code 0 Statistics: started: 18 timeout: 3 missing: 0 extra: 0 marked: 0 Running backup at 2023-09-10 08:11:58.611870 Timed out after 3 seconds Statistics: started: 19 timeout: 4 missing: 0 extra: 0 marked: 0 Running backup at 2023-09-10 08:12:06.945352 Timed out after 4 seconds Statistics: started: 20 timeout: 5 missing: 0 extra: 0 marked: 0 Running backup at 2023-09-10 08:12:16.464174 Timed out after 10 seconds Statistics: started: 21 timeout: 6 missing: 0 extra: 0 marked: 0 Running backup at 2023-09-10 08:12:31.811753 Exit code 0 duplicati-icebfdb6777d24dda8b75fed3787c71ac.dindex.zip Running recreate at 2023-09-10 08:12:39.262241 Running repair at 2023-09-10 08:12:39.262241 Exit code 0 Statistics: started: 22 timeout: 6 missing: 0 extra: 0 marked: 0 Running backup at 2023-09-10 08:13:01.969719 Exit code 0 duplicati-icebfdb6777d24dda8b75fed3787c71ac.dindex.zip Running recreate at 2023-09-10 08:13:09.442398 Running repair at 2023-09-10 08:13:09.442398 Exit code 0 Statistics: started: 23 timeout: 6 missing: 0 extra: 0 marked: 0 Running backup at 2023-09-10 08:13:31.168298 Timed out after 2 seconds Statistics: started: 24 timeout: 7 missing: 0 extra: 0 marked: 0 Running backup at 2023-09-10 08:13:38.449298 Exit code 0 duplicati-icebfdb6777d24dda8b75fed3787c71ac.dindex.zip Running recreate at 2023-09-10 08:13:47.858636 Running repair at 2023-09-10 08:13:47.858636 Exit code 0 Statistics: started: 25 timeout: 7 missing: 0 extra: 0 marked: 0 Running backup at 2023-09-10 08:14:11.596258 Exit code 0 duplicati-icebfdb6777d24dda8b75fed3787c71ac.dindex.zip Running recreate at 2023-09-10 08:14:26.134430 Running repair at 2023-09-10 08:14:26.134430 Exit code 0 Statistics: started: 26 timeout: 7 missing: 0 extra: 0 marked: 0 Running backup at 2023-09-10 08:14:52.249091 Timed out after 6 seconds Statistics: started: 27 timeout: 8 missing: 0 extra: 0 marked: 0 Running backup at 2023-09-10 08:15:03.624671 Exit code 0 duplicati-icebfdb6777d24dda8b75fed3787c71ac.dindex.zip Running recreate at 2023-09-10 08:15:12.084502 Running repair at 2023-09-10 08:15:12.084502 Exit code 0 Statistics: started: 28 timeout: 8 missing: 0 extra: 0 marked: 0 Running backup at 2023-09-10 08:15:37.768793 Exit code 0 duplicati-icebfdb6777d24dda8b75fed3787c71ac.dindex.zip Running recreate at 2023-09-10 08:15:46.206783 Running repair at 2023-09-10 08:15:46.206783 Exit code 0 Statistics: started: 29 timeout: 8 missing: 0 extra: 0 marked: 0 Running backup at 2023-09-10 08:16:09.923359 Exit code 0 duplicati-icebfdb6777d24dda8b75fed3787c71ac.dindex.zip Running recreate at 2023-09-10 08:16:18.360990 Running repair at 2023-09-10 08:16:18.360990 Exit code 0 Statistics: started: 30 timeout: 8 missing: 0 extra: 0 marked: 0 Running backup at 2023-09-10 08:16:45.544337 Timed out after 16 seconds Statistics: started: 31 timeout: 9 missing: 0 extra: 0 marked: 0 Running backup at 2023-09-10 08:17:06.831278 Exit code 0 duplicati-icebfdb6777d24dda8b75fed3787c71ac.dindex.zip Running recreate at 2023-09-10 08:17:13.263109 Running repair at 2023-09-10 08:17:13.263109 Exit code 0 Remote file referenced as duplicati-b6587050dc3b146fa9cfcb1493aaf9751.dblock.zip by duplicati-icebfdb6777d24dda8b75fed3787c71ac.dindex.zip, but not found in list, registering a missing remote file Found 1 missing volumes; attempting to replace blocks from existing volumes Found 1 missing volumes; attempting to replace blocks from existing volumes Statistics: started: 32 timeout: 9 missing: 0 extra: 0 marked: 0 ``` So let's try a `glogg` profiling log search, starting with those two files. Add on the original dindex, and Compacting so we can also get that context: ``` 2023-09-10 08:11:35 -04 - [Information-Duplicati.Library.Main.Database.LocalDeleteDatabase-CompactReason]: Compacting not required 2023-09-10 08:12:09 -04 - [Profiling-Timer.Begin-Duplicati.Library.Main.Database.ExtensionMethods-ExecuteScalarInt64]: Starting - ExecuteScalarInt64: INSERT INTO "Remotevolume" ("OperationID", "Name", "Type", "State", "Size", "VerificationCount", "DeleteGraceTime") VALUES (38, "duplicati-b6587050dc3b146fa9cfcb1493aaf9751.dblock.zip", "Blocks", "Temporary", -1, 0, 0); SELECT last_insert_rowid(); 2023-09-10 08:12:09 -04 - [Profiling-Timer.Finished-Duplicati.Library.Main.Database.ExtensionMethods-ExecuteScalarInt64]: ExecuteScalarInt64: INSERT INTO "Remotevolume" ("OperationID", "Name", "Type", "State", "Size", "VerificationCount", "DeleteGraceTime") VALUES (38, "duplicati-b6587050dc3b146fa9cfcb1493aaf9751.dblock.zip", "Blocks", "Temporary", -1, 0, 0); SELECT last_insert_rowid(); took 0:00:00:00.000 2023-09-10 08:12:09 -04 - [Information-Duplicati.Library.Main.BasicResults-BackendEvent]: Backend event: Put - Started: duplicati-b6587050dc3b146fa9cfcb1493aaf9751.dblock.zip (4.99 MB) 2023-09-10 08:12:09 -04 - [Information-Duplicati.Library.Main.BasicResults-BackendEvent]: Backend event: Put - Completed: duplicati-b6587050dc3b146fa9cfcb1493aaf9751.dblock.zip (4.99 MB) 2023-09-10 08:12:10 -04 - [Profiling-Timer.Begin-Duplicati.Library.Main.Database.ExtensionMethods-ExecuteScalarInt64]: Starting - ExecuteScalarInt64: INSERT INTO "Remotevolume" ("OperationID", "Name", "Type", "State", "Size", "VerificationCount", "DeleteGraceTime") VALUES (38, "duplicati-icebfdb6777d24dda8b75fed3787c71ac.dindex.zip", "Index", "Temporary", -1, 0, 0); SELECT last_insert_rowid(); 2023-09-10 08:12:10 -04 - [Profiling-Timer.Finished-Duplicati.Library.Main.Database.ExtensionMethods-ExecuteScalarInt64]: ExecuteScalarInt64: INSERT INTO "Remotevolume" ("OperationID", "Name", "Type", "State", "Size", "VerificationCount", "DeleteGraceTime") VALUES (38, "duplicati-icebfdb6777d24dda8b75fed3787c71ac.dindex.zip", "Index", "Temporary", -1, 0, 0); SELECT last_insert_rowid(); took 0:00:00:00.000 2023-09-10 08:12:10 -04 - [Information-Duplicati.Library.Main.BasicResults-BackendEvent]: Backend event: Put - Started: duplicati-icebfdb6777d24dda8b75fed3787c71ac.dindex.zip (18.09 KB) 2023-09-10 08:12:10 -04 - [Information-Duplicati.Library.Main.BasicResults-BackendEvent]: Backend event: Put - Completed: duplicati-icebfdb6777d24dda8b75fed3787c71ac.dindex.zip (18.09 KB) 2023-09-10 08:12:20 -04 - [Information-Duplicati.Library.Main.Operation.Backup.RecreateMissingIndexFiles-RecreateMissingIndexFile]: Re-creating missing index file for duplicati-b6587050dc3b146fa9cfcb1493aaf9751.dblock.zip 2023-09-10 08:12:20 -04 - [Profiling-Timer.Begin-Duplicati.Library.Main.Database.ExtensionMethods-ExecuteScalarInt64]: Starting - ExecuteScalarInt64: INSERT INTO "Remotevolume" ("OperationID", "Name", "Type", "State", "Size", "VerificationCount", "DeleteGraceTime") VALUES (39, "duplicati-i0e19c49bd46b47fdafbb5bc0e2df893a.dindex.zip", "Index", "Temporary", -1, 0, 0); SELECT last_insert_rowid(); 2023-09-10 08:12:20 -04 - [Profiling-Timer.Finished-Duplicati.Library.Main.Database.ExtensionMethods-ExecuteScalarInt64]: ExecuteScalarInt64: INSERT INTO "Remotevolume" ("OperationID", "Name", "Type", "State", "Size", "VerificationCount", "DeleteGraceTime") VALUES (39, "duplicati-i0e19c49bd46b47fdafbb5bc0e2df893a.dindex.zip", "Index", "Temporary", -1, 0, 0); SELECT last_insert_rowid(); took 0:00:00:00.000 2023-09-10 08:12:20 -04 - [Information-Duplicati.Library.Main.BasicResults-BackendEvent]: Backend event: Put - Started: duplicati-i0e19c49bd46b47fdafbb5bc0e2df893a.dindex.zip (2.29 KB) 2023-09-10 08:12:20 -04 - [Information-Duplicati.Library.Main.BasicResults-BackendEvent]: Backend event: Put - Completed: duplicati-i0e19c49bd46b47fdafbb5bc0e2df893a.dindex.zip (2.29 KB) 2023-09-10 08:12:25 -04 - [Information-Duplicati.Library.Main.Database.LocalDeleteDatabase-CompactReason]: Compacting not required 2023-09-10 08:12:36 -04 - [Information-Duplicati.Library.Main.Database.LocalDeleteDatabase-CompactReason]: Compacting not required 2023-09-10 08:12:37 -04 - [Information-Duplicati.Library.Main.BasicResults-BackendEvent]: Backend event: Get - Started: duplicati-icebfdb6777d24dda8b75fed3787c71ac.dindex.zip (18.09 KB) 2023-09-10 08:12:37 -04 - [Information-Duplicati.Library.Main.BasicResults-BackendEvent]: Backend event: Get - Completed: duplicati-icebfdb6777d24dda8b75fed3787c71ac.dindex.zip (18.09 KB) 2023-09-10 08:12:37 -04 - [Profiling-Timer.Begin-Duplicati.Library.Main.Database.ExtensionMethods-ExecuteNonQuery]: Starting - ExecuteNonQuery: UPDATE "RemoteVolume" SET "VerificationCount" = MAX(1, CASE WHEN "VerificationCount" <= 0 THEN (SELECT MAX("VerificationCount") FROM "RemoteVolume") ELSE "VerificationCount" + 1 END) WHERE "Name" = "duplicati-icebfdb6777d24dda8b75fed3787c71ac.dindex.zip" 2023-09-10 08:12:37 -04 - [Profiling-Timer.Finished-Duplicati.Library.Main.Database.ExtensionMethods-ExecuteNonQuery]: ExecuteNonQuery: UPDATE "RemoteVolume" SET "VerificationCount" = MAX(1, CASE WHEN "VerificationCount" <= 0 THEN (SELECT MAX("VerificationCount") FROM "RemoteVolume") ELSE "VerificationCount" + 1 END) WHERE "Name" = "duplicati-icebfdb6777d24dda8b75fed3787c71ac.dindex.zip" took 0:00:00:00.222 2023-09-10 08:13:06 -04 - [Information-Duplicati.Library.Main.Database.LocalDeleteDatabase-CompactReason]: Compacting because there are 1 fully deletable volume(s) 2023-09-10 08:13:44 -04 - [Information-Duplicati.Library.Main.Database.LocalDeleteDatabase-CompactReason]: Compacting not required 2023-09-10 08:14:16 -04 - [Information-Duplicati.Library.Main.Database.LocalDeleteDatabase-CompactReason]: Compacting because there is 34.15% wasted space and the limit is 25% 2023-09-10 08:14:24 -04 - [Information-Duplicati.Library.Main.BasicResults-BackendEvent]: Backend event: Get - Started: duplicati-i0e19c49bd46b47fdafbb5bc0e2df893a.dindex.zip (2.29 KB) 2023-09-10 08:14:24 -04 - [Information-Duplicati.Library.Main.BasicResults-BackendEvent]: Backend event: Get - Completed: duplicati-i0e19c49bd46b47fdafbb5bc0e2df893a.dindex.zip (2.29 KB) 2023-09-10 08:14:24 -04 - [Profiling-Timer.Begin-Duplicati.Library.Main.Database.ExtensionMethods-ExecuteNonQuery]: Starting - ExecuteNonQuery: UPDATE "RemoteVolume" SET "VerificationCount" = MAX(1, CASE WHEN "VerificationCount" <= 0 THEN (SELECT MAX("VerificationCount") FROM "RemoteVolume") ELSE "VerificationCount" + 1 END) WHERE "Name" = "duplicati-i0e19c49bd46b47fdafbb5bc0e2df893a.dindex.zip" 2023-09-10 08:14:25 -04 - [Profiling-Timer.Finished-Duplicati.Library.Main.Database.ExtensionMethods-ExecuteNonQuery]: ExecuteNonQuery: UPDATE "RemoteVolume" SET "VerificationCount" = MAX(1, CASE WHEN "VerificationCount" <= 0 THEN (SELECT MAX("VerificationCount") FROM "RemoteVolume") ELSE "VerificationCount" + 1 END) WHERE "Name" = "duplicati-i0e19c49bd46b47fdafbb5bc0e2df893a.dindex.zip" took 0:00:00:00.184 2023-09-10 08:14:25 -04 - [Information-Duplicati.Library.Main.BasicResults-BackendEvent]: Backend event: Get - Started: duplicati-b6587050dc3b146fa9cfcb1493aaf9751.dblock.zip (4.99 MB) 2023-09-10 08:14:25 -04 - [Information-Duplicati.Library.Main.BasicResults-BackendEvent]: Backend event: Get - Completed: duplicati-b6587050dc3b146fa9cfcb1493aaf9751.dblock.zip (4.99 MB) 2023-09-10 08:14:25 -04 - [Profiling-Timer.Begin-Duplicati.Library.Main.Database.ExtensionMethods-ExecuteNonQuery]: Starting - ExecuteNonQuery: UPDATE "RemoteVolume" SET "VerificationCount" = MAX(1, CASE WHEN "VerificationCount" <= 0 THEN (SELECT MAX("VerificationCount") FROM "RemoteVolume") ELSE "VerificationCount" + 1 END) WHERE "Name" = "duplicati-b6587050dc3b146fa9cfcb1493aaf9751.dblock.zip" 2023-09-10 08:14:25 -04 - [Profiling-Timer.Finished-Duplicati.Library.Main.Database.ExtensionMethods-ExecuteNonQuery]: ExecuteNonQuery: UPDATE "RemoteVolume" SET "VerificationCount" = MAX(1, CASE WHEN "VerificationCount" <= 0 THEN (SELECT MAX("VerificationCount") FROM "RemoteVolume") ELSE "VerificationCount" + 1 END) WHERE "Name" = "duplicati-b6587050dc3b146fa9cfcb1493aaf9751.dblock.zip" took 0:00:00:00.069 2023-09-10 08:14:57 -04 - [Information-Duplicati.Library.Main.Database.LocalDeleteDatabase-CompactReason]: Compacting not required 2023-09-10 08:15:08 -04 - [Information-Duplicati.Library.Main.Database.LocalDeleteDatabase-CompactReason]: Compacting not required 2023-09-10 08:15:43 -04 - [Information-Duplicati.Library.Main.Database.LocalDeleteDatabase-CompactReason]: Compacting not required 2023-09-10 08:16:14 -04 - [Information-Duplicati.Library.Main.Database.LocalDeleteDatabase-CompactReason]: Compacting not required 2023-09-10 08:16:49 -04 - [Information-Duplicati.Library.Main.Database.LocalDeleteDatabase-CompactReason]: Compacting because there is 28.32% wasted space and the limit is 25% 2023-09-10 08:16:50 -04 - [Information-Duplicati.Library.Main.BasicResults-BackendEvent]: Backend event: Get - Started: duplicati-b6587050dc3b146fa9cfcb1493aaf9751.dblock.zip (4.99 MB) 2023-09-10 08:16:50 -04 - [Information-Duplicati.Library.Main.BasicResults-BackendEvent]: Backend event: Get - Completed: duplicati-b6587050dc3b146fa9cfcb1493aaf9751.dblock.zip (4.99 MB) 2023-09-10 08:16:51 -04 - [Information-Duplicati.Library.Main.BasicResults-BackendEvent]: Backend event: Delete - Started: duplicati-b6587050dc3b146fa9cfcb1493aaf9751.dblock.zip (4.99 MB) 2023-09-10 08:16:51 -04 - [Information-Duplicati.Library.Main.BasicResults-BackendEvent]: Backend event: Delete - Completed: duplicati-b6587050dc3b146fa9cfcb1493aaf9751.dblock.zip (4.99 MB) 2023-09-10 08:16:51 -04 - [Information-Duplicati.Library.Main.BasicResults-BackendEvent]: Backend event: Delete - Started: duplicati-i0e19c49bd46b47fdafbb5bc0e2df893a.dindex.zip (2.29 KB) 2023-09-10 08:16:51 -04 - [Information-Duplicati.Library.Main.BasicResults-BackendEvent]: Backend event: Delete - Completed: duplicati-i0e19c49bd46b47fdafbb5bc0e2df893a.dindex.zip (2.29 KB) 2023-09-10 08:16:52 -04 - [Profiling-Timer.Begin-Duplicati.Library.Main.Database.ExtensionMethods-ExecuteNonQuery]: Starting - ExecuteNonQuery: UPDATE "RemoteVolume" SET "DeleteGraceTime" = 1694355412 WHERE "Name" = "duplicati-b6587050dc3b146fa9cfcb1493aaf9751.dblock.zip" 2023-09-10 08:16:52 -04 - [Profiling-Timer.Finished-Duplicati.Library.Main.Database.ExtensionMethods-ExecuteNonQuery]: ExecuteNonQuery: UPDATE "RemoteVolume" SET "DeleteGraceTime" = 1694355412 WHERE "Name" = "duplicati-b6587050dc3b146fa9cfcb1493aaf9751.dblock.zip" took 0:00:00:00.000 2023-09-10 08:16:52 -04 - [Profiling-Timer.Begin-Duplicati.Library.Main.Database.ExtensionMethods-ExecuteNonQuery]: Starting - ExecuteNonQuery: UPDATE "RemoteVolume" SET "DeleteGraceTime" = 1694355412 WHERE "Name" = "duplicati-i0e19c49bd46b47fdafbb5bc0e2df893a.dindex.zip" 2023-09-10 08:16:52 -04 - [Profiling-Timer.Finished-Duplicati.Library.Main.Database.ExtensionMethods-ExecuteNonQuery]: ExecuteNonQuery: UPDATE "RemoteVolume" SET "DeleteGraceTime" = 1694355412 WHERE "Name" = "duplicati-i0e19c49bd46b47fdafbb5bc0e2df893a.dindex.zip" took 0:00:00:00.000 2023-09-10 08:16:52 -04 - [Profiling-Timer.Begin-Duplicati.Library.Main.Database.ExtensionMethods-ExecuteScalarInt64]: Starting - ExecuteScalarInt64: SELECT "ID" FROM "Remotevolume" WHERE "Name" = "duplicati-b6587050dc3b146fa9cfcb1493aaf9751.dblock.zip" 2023-09-10 08:16:52 -04 - [Profiling-Timer.Finished-Duplicati.Library.Main.Database.ExtensionMethods-ExecuteScalarInt64]: ExecuteScalarInt64: SELECT "ID" FROM "Remotevolume" WHERE "Name" = "duplicati-b6587050dc3b146fa9cfcb1493aaf9751.dblock.zip" took 0:00:00:00.000 2023-09-10 08:16:52 -04 - [Profiling-Timer.Begin-Duplicati.Library.Main.Database.ExtensionMethods-ExecuteScalarInt64]: Starting - ExecuteScalarInt64: SELECT "ID" FROM "Remotevolume" WHERE "Name" = "duplicati-i0e19c49bd46b47fdafbb5bc0e2df893a.dindex.zip" 2023-09-10 08:16:52 -04 - [Profiling-Timer.Finished-Duplicati.Library.Main.Database.ExtensionMethods-ExecuteScalarInt64]: ExecuteScalarInt64: SELECT "ID" FROM "Remotevolume" WHERE "Name" = "duplicati-i0e19c49bd46b47fdafbb5bc0e2df893a.dindex.zip" took 0:00:00:00.000 2023-09-10 08:17:11 -04 - [Information-Duplicati.Library.Main.Database.LocalDeleteDatabase-CompactReason]: Compacting not required ``` Notable moments: ``` 2023-09-10 08:12:09 -04 - [Information-Duplicati.Library.Main.BasicResults-BackendEvent]: Backend event: Put - Completed: duplicati-b6587050dc3b146fa9cfcb1493aaf9751.dblock.zip (4.99 MB) 2023-09-10 08:12:10 -04 - [Information-Duplicati.Library.Main.BasicResults-BackendEvent]: Backend event: Put - Completed: duplicati-icebfdb6777d24dda8b75fed3787c71ac.dindex.zip (18.09 KB) 2023-09-10 08:12:20 -04 - [Information-Duplicati.Library.Main.Operation.Backup.RecreateMissingIndexFiles-RecreateMissingIndexFile]: Re-creating missing index file for duplicati-b6587050dc3b146fa9cfcb1493aaf9751.dblock.zip 2023-09-10 08:12:20 -04 - [Information-Duplicati.Library.Main.BasicResults-BackendEvent]: Backend event: Put - Completed: duplicati-i0e19c49bd46b47fdafbb5bc0e2df893a.dindex.zip (2.29 KB) Running backup at 2023-09-10 08:12:31.811753 Exit code 0 duplicati-icebfdb6777d24dda8b75fed3787c71ac.dindex.zip 2023-09-10 08:16:49 -04 - [Information-Duplicati.Library.Main.Database.LocalDeleteDatabase-CompactReason]: Compacting because there is 28.32% wasted space and the limit is 25% 2023-09-10 08:16:51 -04 - [Information-Duplicati.Library.Main.BasicResults-BackendEvent]: Backend event: Delete - Completed: duplicati-b6587050dc3b146fa9cfcb1493aaf9751.dblock.zip (4.99 MB) 2023-09-10 08:16:51 -04 - [Information-Duplicati.Library.Main.BasicResults-BackendEvent]: Backend event: Delete - Completed: duplicati-i0e19c49bd46b47fdafbb5bc0e2df893a.dindex.zip (2.29 KB) Running backup at 2023-09-10 08:17:06.831278 Exit code 0 duplicati-icebfdb6777d24dda8b75fed3787c71ac.dindex.zip Running recreate at 2023-09-10 08:17:13.263109 Running repair at 2023-09-10 08:17:13.263109 Exit code 0 Remote file referenced as duplicati-b6587050dc3b146fa9cfcb1493aaf9751.dblock.zip by duplicati-icebfdb6777d24dda8b75fed3787c71ac.dindex.zip, but not found in list, registering a missing remote file ``` ## Steps to reproduce 1. Well, there's a still-changing test script `timeout5.py` for Windows that I can provide and explain for someone if they're interested. - **Actual result**: Error, although at least in standalone recreate (rather than restore) it looks like it wasn't fatal. Workaround = hide or delete that file. - **Expected result**: No error ## Screenshots ## Debug log

Essentially it is caused by an interrupted file upload which leaves the database in an incorrect state. The next backup recovers from the interruption, but it leaves the index file with a broken reference.

gdassieu · October 27, 2024, 4:35am

Hi @ts678, @kenkendk, @Jojo-1000,

Thank you All for your replies.

Indeed, I’ve been working on an old version (2.0.6.3) for quite some time.

I suspect Jojo’s answer is relevant. I have a daily automatic backup running in the background, so it’s well within the realm of possibility that I’ve had interrupted backups in the past (e.g. computer shutdown, network connection lost, etc…).

Unfortunately, I do no longer have the original DB, since I was for quite a while on 2.0.6.3, upgraded directly to 2.0.9.108, and due to the “temporary” column error I downgraded to 2.0.8.1 and decided to rebuild the DB (I figured, perhaps incorrectly, that 2.0.8.1 would not work on the “upgraded” 2.0.9.108 DB).

Now, my current DB was successfully rebuilt on 2.0.8.1.

But, when doing my DRP tests directly from backend storage, I am encountering a MissingFileDetected error. Nevertheless, my latest backup restores just fine (no errors, and when I compare the restore with the source data using winmerge all seems good).

Exact error during the DB repair phase was:
2024-10-26 17:59:54 +09 - [Error-Duplicati.Library.Main.Operation.RecreateDatabaseHandler-MissingFileDetected]: Remote file referenced as duplicati-b6fe6e2f12bb8478c942f692a6f8287d8.dblock.zip.aes by duplicati-i0fbd36f50cd34f878178316fc62f21e8.dindex.zip.aes, but not found in list, registering a missing remote file

When I check the backend storage:
duplicati-b6fe6e2f12bb8478c942f692a6f8287d8.dblock.zip.aes does not exist
duplicati-i0fbd36f50cd34f878178316fc62f21e8.dindex.zip.aes does exist, modified date was 2024-03-16.

Indeed, back in March, I was running old version 2.0.6.3, which did not include the fix mentioned by @Jojo-1000.

I understand the version I am running now (2.0.8.1) includes the fix to the cause of this problem, but old data on the backend would still be impacted.

Is there a way I can?:

Confirm that none of my valid backups (the ones kept by the retention policy) is impacted by this missing dblock? Hopefully a more efficient way than test-restoring ~100+GB of data from the Cloud 18 times
How can I “repair” such inconsistencies from my database and backend data?

Thanks in advance !

Gaston

Jojo-1000 · October 27, 2024, 7:47am

The fix for this issue is not applied in any released version, probably because there was no time to review it yet. Fortunately, it should be possible to recover from this manually and it doesn’t lose any data.

You can rename the index file with the missing reference so that it is not recognized (for example duplicati-*** to _duplicati-***). Then a new database recreate should work without errors. If there is missing data, it should be reported during that as well.

If renaming the extra index file doesn’t help, you have a different kind of error, so keep the file for more troubleshooting.

gdassieu · October 27, 2024, 9:13am

Hi @Jojo-1000

Thank you very much.

I spent some time reading how Duplicati works:

Very interesting reading.

If I understand correctly, the purpose of the dindex files is to know which hashes are stored in which dblock files.

I manually decrypted a few dindex files and looked inside of them, and it seems like there is a 1:1 mapping between dindex and dblock files (each dindex file references only one dblock file). Therefore, if I delete the dindex file in the error message, normally it should only impact the dblock file (which anyway is missing). In other words, there is no risk that I break anything other than what is already broken. Is my understanding correct?

Further, let’s say (not the case here, but just for my knowledge) that what was lost is a dindex file, but not its referenced dblock file. In that case, does dupicati automatically rebuild the dindex file from the dblock file? But if so, considering that the file names of dblock and dindex are different, how can duplicati know which dblock (over thousands of files) to download to rebuild the dindex file? I’m guessing it uses the local DB to know which dblocks have dindex file missing?

Sorry for such n00b questions. I find this topic very interesting, so I try to take the opportunity to learn about it

Cheers,

Gaston

ts678 · October 27, 2024, 1:10pm

From original post of the Issue, here was my guess, and I didn’t read enough to see if it’s wrong:

Regardless, the extra dindex file is harmless until dblock+new dindex from the fix get deleted by a compact. After that, there’s old dindex pointing to a dblock that’s not there. Before that, maybe recreate was happy because it found two dindex, but both pointed to the same dblock, and had the same information, so no harm done. Just a guess.

Specific cases may vary anyway, so conservative path is to not delete files early and regret later.

What I like to see on Recreate is progress bar not getting past 70%. Past 90% can be very slow.

This range is downloading dblocks, and a great view is in About → Show log → Live → Verbose.

If recreate problem came up within the last 30 days (if at default log-retention), database has the history to see a duplicati-b6fe6e2f12bb8478c942f692a6f8287d8.dblock.zip.aes deletion, unfortunately a recreate every 3 months might have missed it. You could try an SQLite viewer on RemoteOperation table Path column to see if it’s seen. DB Browser for SQLite can filter column.

Probably, but I don’t like to talk in certainties when situation is uncertain and being investigated.

I think that’s right IIRC. Maybe Remotevolume table ID and IndexBlockLink table for starter.

This is default case, but it can get out of whack, and it doesn’t seem to have any advance test.

Possibly this is because (I think) dindex files are not technically required, e.g. there’s an option

  --index-file-policy (Enumeration): Determine usage of index files
    The index files are used to limit the need for downloading dblock files when there is no local database present. The more information is recorded in the index files, the faster operations can proceed
    without the database. The tradeoff is that larger index files take up more remote space and which may never be used.
    * values: None, Lookup, Full
    * default value: Full

but it’s annoying because if you lose a dindex file, then recreate the database, then it will be in a forever slow recreate situation because it won’t make a dindex for the dblock, even though it has enough information. I think one time in test, I edited the database to trick it into recreating dindex.

gdassieu · October 29, 2024, 12:04am

Thanks @ts678 for the additional info.

For this particular case, as I downloaded the dindex file, descrypted it and confirmed that it only referenced one dblock file, which was the missing one, I guess I am “safe”.

So, I removed it from the backup storage (kept a copy just in case), and then rebuilt the DB. This time, it rebuilt without any errors.

I also made a direct restore from backup files (the DRP test I mentioned earlier). That also worked flawlessly.

So, that solved the issue! Thanks Everyone for your valuable input.

Best regards,

Gaston