Some common categories:
(1) Financial: account statements (pdf, paper, etc), transaction CSV/ofx/etc, insurance policies, financial institution correspondence (confirmations, T&Cs, etc), tax forms, deeds/titles, loans, debts, contemporaneous records, paystubs, employment letters, receipts, etc
(2) Health: test results, diagnoses, medications used, doctor info/correspondence, vaccinations, hospital interactions, statements, bills, etc
(3) Product and services info: products purchased, maintenance info, warranties, replacement parts, recall notices, class action events, professional used (CPAs, lawyers, contractors, plumbers, etc), etc
(4) Personal legal and other documents: wills, trusts, health directives, pre-nups, legal settlements, personal contracts, rental agreements, government IDs, immigration/naturalization docs, law enforcement interactions, etc
(5) Memories and other info of sentimental or other value: pictures, videos, cards, recordings, music, diaries, clippings, personal notes, etc
For many of these one might ask the usual questions:
A. do I have a reason/obligation to retain it? for example, tax authorities often require certain classes of record keeping. sometimes commercial entities make false claims in retrospect that you can only rebut with evidence (even major banks!). some documents can end up only being used by heirs at end-of-life (ex: basis information for assets like a home, which can require proof of improvements over the years, depreciation claimed if the home was ever rented, etc)
B. is there a downside to retaining it? clutter, hard/costly to move, identity/other theft risk, physical decay over time (ex: obsolete storage formats (VHS, mini-DV, blueray)), etc
C. will I (or the appropriate person) be able to find it when I need it? physical paper vs electronic mgmt, indexing, filing, passwords, legacy services for transferring access.
D. can I actually retain it? backups, water damage, fire, theft, physical safe management, etc some online services don't provide any/easy export of data, many have inconsistent retention policies (ex utility companies)
Some of the categories above have specialized software and services dedicated to them (ex: photos services, quickbooks, etc) while some are more ad hoc (goog drive, dropbox, local HDD); even specialized services often fall short on key matters (quickbooks doesn't track statements afaik); many specialized systems have various platform limitations (no turbotax on linux, say); many vendors try to play various lock-in tricks; some vendors go out of business or discontinue products over the years; some institutions are very paper oriented so one perhaps scans or tries to limit paper delivery; some services place data in the cloud creating security exposure; etc etc
Everything is stored locally, unencrypted, and backed up on a local removable HDD, and on BackBlaze. BackBlaze has saved my bacon more than once.
As for physical objects, they're all temporarily mine, until I pass on. They're either tools, consumables, or tied to personal memories. Everything else can go.
Photos (hundreds of GBs) are also in Subversion hosted on same server. Photos are managed/viewed either via Digikam (on both Mac, Linux, Windows) as well as through a generated HTML/js gallery served on the same server.
Home server is backed up via restic to
* local harddrives that are rotated weekly and moved offsite. * to a rented physical Debian server in a different country * to a cloud storage provider
Server storage is fully encrypted using LUKS. Backups are encrypted via restic/ssh.
Various sensitive documents are separately encrypted inside the svn repos via 'age' or similar tools.
This setup has been working for almost two decades. I have yet to lose any data.
Obligation wise it's good to keep all documents for whatever the longest record keeping period is in your region.
Important that you scan your identity documents and keep copies too. That has been incredibly useful when I'm abroad because a lot of people will retain passports etc if you go in a hospital or something so you can just show them the electronic copy and tell them you left it in the hotel.
Backups via restic end up on backblaze. Haven’t gotten around to set up the local backup. This is a good reminder to do so. :)
The photos end up in iCloud and Adobe with a backup in backblaze. Not so happy yet with this solution. But most alternatives I tried weren’t as comfortable.
Edit: The paperless ngx server is a low power machine that runs in my local network.
Important stuff has a photo taken, and it's likely backed up in my photo archives (Syncthing → Pi Storage → NAS backup).
My girlfriend still uses google and whatsapp, and I tend to send her important media in our chats, so we'd both have to be kicked out of our gmail and whatsapp accounts to truly lose everything there.
I used to keep photos and videos on an external hard disk with a jwz-style backup mechanism and Lightroom 3 for organizing it (kept the light room database file on the hard disk itself)
Now just gave up and use iCloud Photos. Not as good as Lightroom but can't find anything better that's also easy.
Secure details go into 1Password with a printout of emergency keys in a wardrobe under lock.
I used to use Evernote but moved away from it soon after they started introducing chat and a bunch of crap no one cared about.
I keep copies of often-referenced items from Dropbox in Apple Notes but the master copy is a file in Dropbox.
I'm using the Shortcuts app on the phone to take a receipt from an online payment and file it away in the /receipts folder in Dropbox.
I used to have an IFTTT integration to take all the photos I posted on Facebook and keep a copy of them in a Dropbox folder as well. It probably still works but I don't really post stuff anywhere any more.
For digital stuff it’s mostly the same idea. If it’s email (which is almost always the case), I archive and forget about it. If it’s something else, I’ll throw it in my virtual “messy drawer” which is just a folder in Google Drive.
You are getting quite a few clever ideas in this thread, but I suspect the vast majority of people out there in the real world do more or less the same as I do (of course, I’m biased).
My “philosophy” (overstatement) is that 99.99% of the time I’ll never need it. If I ever do, it might be a little painful to dig around, but that doesn’t justify having a whole system in place. I would rather keep my mess.
There's always a bit too much that is attached to emails, I try to remember to save stuff out.
A Synology NAS in the attic which backs up the above.
No encryption.
An £50 "fireproof" box for passports, citizenship certs, things where the original is important.
I assembled a ridiculous old server to do tape backups but couldn't make a very practical system - I wish I could attach an LTO drive to something smaller!
As for products, i have aimed to buy services that are easily transferrable, i.e. we use iCloud family sharing, meaning every user has their own storage, but they piggy back on my paid subscription. If/when i stop paying, they can chose to continue paying by themselves, and things will just "magically" keep working, but otherwise it's a tricky game.
As for memories, i store everything in the cloud. Most is encrypted by Cryptomator. I have a small server at home synchronizing data home in real time, and the server backs up data hourly to a local target, and a couple of times per day to another cloud.
I have thought long and hard about decentralizing this to let each users laptop do this, but with a family photo album of 3.5TB that would mean i lose deduplication.
It would also put much more demand on the client hardware, as it would need to be able to hold the entire photo library to back it up. I have LONG REQUESTED a way to backup iCloud data without the need to keep it all local, i.e. download upon use for backup, and delete again, but sadly that's not how it works. If your data is "cloud only", your backup will only backup file pointers, and not the actual photos.
Once every year i make Blu-Ray M-disc archives of the previous years photos, and to some lesser extent also documents. I make identical copies and store them in geographically separate locations. I use no encryption or compression on these archive discs. I also have a set of identical external USB drives that i update with the entire archive yearly and then rotate between locations.
I am fully aware that optical media may not be around for much longer, but that's a problem for future me. For now it is the best and most affordable "long term, no maintenance" storage technology available to consumers. Until an official "end of life" is reported, i will keep using it. Once it's EOL i can easily migrate my archive to "something more better", even if that ends up being just ordering photo copies.
I store everything in the cloud, because it is unbeatably convenient except for the very things mentioned: health, financial and legal. I'm not 100% strict either, because if a document has been sent or received electronically or can be assumed to be stored in the cloud anyways I really can't do much. I still assume that every online storage will be breached one day and keep some documents deliberately offline.
To give you some chills, check out "Who's been pwned" [1] and this is only the tip of the iceberg that has surfaced and that fulfills HIBP's inclusion criteria.
So, in the end, I just keep them in the Documents directory. Some, most commonly used are also in my email. That's it.
The fireproof bag also has my account/password list (printed out once a year), birth certificates, titles, and passports.
If you are the tech person in your family, think about what happens if you die. Will your family be able to figure out and maintain your system? Mine is far from perfect, but most of my family can figure out Google Drive.
This works better than I can possibly tell you.
I have an Epson WorkForce ES-580W that I bought when my mother passed away to bulk scan documents and it scans everything, double-sided if required, multi-page PDFs if required, at very high speed and uploads everything to OneDrive, at which point I drag and drop everything into Paperless.
I could, thinking about it, have the scanner email stuff to Paperless. Might investigate that today.
Paperless will OCR it and make it all searchable. This setup is amazing, I love living in the future.
We use SyncThing because it is free, doesn't require user accounts, and works across Mac, Windows, Linux, iOS (Möbius) and Android.
We have a bunch of high level folders (eg. Health, School, etc). Inside that folder every document is named starting with a date, eg: "2023-09-11 Invoice Dentist Kid1.pdf".
So far I've always found every document.
I rsync my picture and document library from my MBP to the NAS. The MBP also has a Backblaze backup. 2-4 times a year I sync my pics, docs, and backups to a 6TB drive that’s offline.
iPhones have very nice scanning tools built in, but it’s buried in the Notes app. Create a new note, click the camera icon, then “scan documents” and it will create a very nice, usually well cropped and OCRd scan that’s saved as a PDF you can then export elsewhere. Wish it was a standalone app because it works so well, and this is from someone who helps digitize and preserve paper documents for a living.
Most recent years letters are however photographed on a phone and put into relevant shared photoalbums, then shredded. Albums by year and category e.g. "School docs", "Kids School work", bills, statements, manuals and instructions, etc. Still have a big chore to go through all the old folders to scan and shred so we can really be paperless.
A few times I have had to dig out old statements or proof of address on a recent bill. However they all just wanted a photo of it, so storing it digitally would have been fine.
Important digital files are all on our family computer which is backed up to a cloud service which will mail me a disk to do a restore if needed. This includes all our photos and videos from since we've had digital cameras. If both my wife and I both die, no one will have any clue how to deal with this and that's OK.
Photos we really like are printed and in photo albums organized by date. We'll lose these in a house fire and that's a risk we're willing to accept. The ability to easily pull a physical album and see pictures from a past time grouped together and curated is something we really like.
Emails go into a plain git repo, and photos go into another git annex repo.
Ideally I’d like to put all of that into Perkeep because I like the idea of just throwing anonymous blobs into a bucket with the possibility to organize and tag them after the fact, but it looks like that project is dead and I would need to add a few things to it to support my use cases.
I store a lot of scans and emails I won’t ever need again. I just consider that storage becomes cheaper faster than my storage needs increase, and that storing extra documents and saving myself the need to decide whether it’s worth keeping, is a good tradeoff.
A) Mint for most financials. iCloud for storage of documents.
B) Apple Health for keeping track of my records. For ad-hoc stuff I use iCloud to store the documents.
C) Product and service info I use iCloud for. Logins and credentials to them are stored in my family's 1Password vault.
D) All legal documents stored on iCloud.
E) For photos and videos I use iCloud Shared Albums. It's unlimited free online storage of photos. Otherwise it's a mix of iCloud storage and the Notes app on Apple devices.
Obviously you can tell I'm deep in the iCloud ecosystem but personally I feel it's worth it. The synchronization and ease of use for my family is well worth it. For what it's worth I'm on the 200GB plan.
Everything else goes into Google drive. I use the Google drive app on Android to scan my physical documents, and I pay for the 200GB plan which is enough for all my stuff.
I've been hoping to write a script to backup stuff into an S3 bucket. I just never have the time.
Fast data:
- documents, pdfs, config files, notes txt etc.
- these files are synced directly, on-site and off-site
Slow data: - archives, images, music, large backups etc.
- these files are transfered to off-site archive on a per-need basis
- they are made available in the (family) cloud as read-only
- they are made available to selected internal services read-only (music service, photo service etc.)
Tech stack: - I self-host
- I follow best practices (zfs, 3-2-1 rule, vpn-only)
- currently I use Nextcloud, rsync and borgmatic
On a more serious note I use MS OneDrive family account and iCloud - I figure if either of those companies goes under there are bigger things to worry about. I print a lot of my photos so I’ve always got hard copies of the kids when they were growing up - everything else is unimportant or replaceable.
I have a filesystem tree that organizes digital copies of all financial documents/data.
The tree is routinely backed up to rugged USB flash drives, labelmakered "FOR HEIRS", and stored in my safe and bank safe deposit box. The drives are unencrypted, and in FAT32 format (though I use Linux), to increase the likelihood that whomever needs it can access it easily.
At the top of the tree is a one-page `READ-THIS-FIRST.md` (text file using CR-LF encoding, in case they view it on MS Windows). It's also printed as paper copies in the safes. This one-page document tells them about the "FOR HEIRS" USB flash drive, the safe deposit box, GnuCash, and various insurances.
Also in the safes is a one-page printout/PDF from GnuCash, which shows all my assets and liabilities, other than durable goods in the household. (I'm guessing those two one-pagers are most of what heirs would need, at least initially. If they need more later, there's the full GnuCash database, and the neatly named and organized PDF files of things like financial service statements.)
I've recently kludged an easy way to maintain household durable goods inventory, by kludging onto purchase transactions in GnuCash. I still have to find the time to encode all the cruft I've acquired in the past, though. If/once I do, a household inventory might become a third paper copy in the safes (for possible someday insurance or estate purposes).
The idea of trusting a single entity (the family computer expert, a single company, or a government agency) to be the sole host or custodian of important records and family photos/videos is fraught with risk: Even if that entity has redundancy across multiple geographic locations, they can have security breaches, their software can fail, their business can fail, or they can otherwise become incapacitated.
Security breach risk can be drastically reduced by encrypting your files before you even store them in that entity's system(s). But you'll need to protect that key and keep it in multiple locations that multiple people have access to. And it's going to make certain functions (like search indexing or file format conversions, to battle file format obsolescence) a lot more complicated.
Data loss risk due to failing hardware, software, companies, or people, can be greatly reduced by copying your stuff to multiple storage providers who ultimately rely on physically different data centers (e.g. if company A and B both ultimately use S3, Amazon is now my single point of failure), and taking an "append-only" approach.
But who is going to spend the time to pre-encrypt and sync all of your family's records like that? Possibly the family computer expert, who you can't depend on long-term.
If you're lucky, you have one who will do all of that for now, and maybe give everybody in the family a copy on a durable USB drive or similar* every few years. Or maybe you can pay someone to do the same. I'd pay for it, because it's a lot of work I don't want to do.
* For longer-term archival storage that should last several decades without power, M-Disc (DVDs or Blu-ray discs) used to be a go-to option. But that depends on people having access to a Blu-ray player decades from now, which I suspect will be a lot like the process of getting access to a floppy drive today.
I have been doing this for quite a while for our family, but I have extended slowly to immediate relatives while we helped them out with technology-related tasks. I haven’t reached the stage of complexities with the files and folder hierarchy that might break cloud providers[4].
Our family has a complete offline copy of everything, including backups of the photos, videos, and others. Everything is still in the works, and I recently started digitizing some old pictures from the in-laws. I stumbled upon some of their photos from around India’s independence and played with the colorization and animation thingy (that popped up a few months back). It became a craze, and their super large family gathered for a presentation, making many elders cry with tears of joy and disbelief.
My most significant achievements, compliments, and moments of pride have been that almost all of the new generations in the family choose to study computer engineering or computer-related. And so far, the best one is the niece commenting, “When I grow up, I want to marry a boy who is as technical as aunty’s husband.”
I have copies of the commercial ones (companies set up, wound down) since their inception. I also try to stick to plain-text[5], and as open formats as possible.
2. https://workspace.google.com
That’s it.
I have a fairly deep paper bin that I lay papers flat in. Any marginal paper documents or receipts that aren't obviously garbage and aren't obviously valuable go into this bin, always placed on the top.
Every 10-12 months I pick up the stack and remove, then shred, the bottom 1/4 of it - without review or remorse. If I didn't need them by then I never would.
I find that I actually use this buffer:
Every month or two there actually is some marginal paper document that I need to shuffle through and find. It's in reverse chronological order so that is very simple.
1. Beancount via org-mode notes, org-babel for transactions, bean queries from the org mode file for reporting, non perfect, a new thing, in the past I've used GNU Cash but I prefer to put anything in org so...
2-5. equally in org-mode org roam managed notes, non textual stuff as org-attachments linked directly in notes or indirectly "elisp:(start-process-shell-command" alike to run different stuff than mere opening.
I have to retain something for fiscal reasons, something for personal reasons and something are extras. It's not that perfectly crafted, but accessing anything by titles (org-roam-node-find) is like having anything in a graph, accessed via comfy search&narrow. When titles fails consult-rg on the org-mode root (~/notes) does the rest.
I've tries org-ql, to query notes for certain infos like "active contracts I have", but even with templating (yasnippet) it's not that immediate, I have some queries and templates but I do not really use if not very rarely. I've try LLama (Khoj) on my notes but well, results are very bad compared to direct textual mach access. So, yes, I have some clutter, I do not see it much, it does not consume significant amount of storage nor pollute my searches so... It's manageable, rarely I found something is the clutter, so it's not totally useless anyway. I am able to find anything noted so far, of course I do not log my life completely so sometimes I'm looking for things I do not have, that's rare but happen. In this cases the answer is "it depend".
I choose to live digitally in Emacs mostly because of integration: I have mails, todos, notes, anything in the same place/tool/model/paradigm like my mind is one for all.
Downsides: it's a personal thing, not really usable by other family members, not designed for collaborative works, it's possible works together but only as ISOLATED individual, all on their own desktop, possibly Emacs but not necessarily and definitively not "the same Emacs" sharing steps/results by various means like patches via mail, direct textual mails "I found this and that, I conclude that" etc.
In this file I write the date and a short note anytime something important happens. Visit the doctor and get test results? Write a note. Change the oil in my car? Write a note. Get a quote for a mortgage? Write a note. Open a new bank account? Write a note. Shop for new running shoes and learn what size fits best? Write a note. Have a contractor come work on the house? Write a note.
If there are important files that relate to those I list filenames and say where they are, but frankly I haven’t found that to be very useful. Rarely do I need any original documents. And if I died prematurely, my family can just scroll through this document to check off accounts (Google will automatically send it to them if I don’t access it for a period of time).
I’ve been using it for a few years and found it to be a total lifesaver. I frequently just need to quickly see how I did something, who I used, or his much something cost.
I store the live version on Google Drive so I can edit it anywhere, but you could host it locally.
Does anyone have access to an AI into which they could copy-paste this whole page and have it extract a ranked list of the products that people mentioned, perhaps along with links and counts of people who mentioned them?
The page size exceeds the limit of what free gpt3.5 will do - so, just asking before trying to do it chunk by chunk.