Hi everyone,

I’m a PhD student in Computer Science researching why people choose to self-host software—what motivates you, what concerns you, and what factors affect your decision-making.

To better understand this, I’ve prepared a short anonymous survey (~10 minutes). Your insights as part of the self-hosting community would be incredibly valuable for this research.

🔗 Survey link: https://survey.lpt.feri.um.si/376953?newtest=Y&lang=en&s=ls

This study is part of my doctoral research at the University of Maribor, Slovenia, conducted under the supervision of Assist. Prof. Lili Nemec Zlatolas, PhD. All responses are anonymous and used strictly for academic purposes.

If you’ve ever self-hosted anything—or even just considered it—I’d really appreciate your input.

Thanks a lot for your time, and feel free to ask me anything about the project (luka.hrgarek@um.si)!

Cheers!

  • skisnow@lemmy.ca
    link
    fedilink
    English
    arrow-up
    22
    ·
    7 days ago

    I’m old enough to consider the framing of the question to be weirdly loaded.

    It does not feel that long ago where people would be asked to justify entrusting their product’s functions and data to a bunch of strangers who can make unilateral decisions about your service with zero comeback. Now we’re being asked to justify not doing that.

    • SelfhostedResearch@lemmy.worldOP
      link
      fedilink
      English
      arrow-up
      1
      ·
      6 days ago

      Thank you for your comment. The use of similar statements is a common practice in survey research, as it helps to capture various dimensions of a construct more reliably and provides a clearer understanding of individual perspectives.

      Regarding your concern, the purpose of this study is not to ask anyone to justify or defend their choices, whether it’s about using third-party services or self-hosting. Instead, we aim to identify the factors that influence such decisions, from a scientific standpoint, to better understand the motivations behind them. The goal is not to judge whether one choice is better than another, but to gain insights into the different considerations that shape people’s decisions when it comes to managing their data and services. Thank you again for taking the time to complete the survey.

      • skisnow@lemmy.ca
        link
        fedilink
        English
        arrow-up
        1
        ·
        6 days ago

        Sure, I’m just bemoaning the fact that you’ve taken cloud hosting to be the default. It’s as much a complaint about the world in general as anything specific to you. Good luck with it all.

        • SelfhostedResearch@lemmy.worldOP
          link
          fedilink
          English
          arrow-up
          2
          ·
          6 days ago

          Totally understand your concern, and you’re right, the assumption of cloud as a default can be frustrating in many ways.

          That said, this framing partly reflects the state of the academic literature: in the past 10–15 years, cloud adoption (especially SaaS) has been extensively studied, to the point where it often feels “default” in research too. In contrast, self-hosting has been far less explored, which is exactly why we’re doing this study—to help fill that gap and highlight its relevance, especially in academic contexts.

          Thanks again for your thoughts and for the good wishes! :)

  • sugar_in_your_tea@sh.itjust.works
    link
    fedilink
    English
    arrow-up
    8
    ·
    7 days ago

    I was kind of surprised by my answers when I stopped to reflect. I realized I:

    • don’t really like self-hosting
    • know a lot about new tech, but am not very excited about it
    • don’t use a lot of the popular services

    Anyway, I hope the results are useful! I don’t know if you’ve done it already, but it would be interesting to compare results from different sources, like Lemmy vs Reddit or wherever you posted it.

  • Crash@lemmy.ml
    link
    fedilink
    English
    arrow-up
    6
    ·
    7 days ago

    I had a hard time answering these because my opinion on cloud service depends on the cloud. (Google vs nextcloud for instance)

  • AnimalsDream@slrpnk.net
    link
    fedilink
    English
    arrow-up
    5
    ·
    7 days ago

    The thing I don’t get about these self-host apps is why so many of them exist when the thing they do would be better to implement as a run of the mill offline program.

    I just want to auto-import recipes from websites into a cookbook app without any fuss. We do not need to bring a server into this equation!

    • InnerScientist@lemmy.world
      link
      fedilink
      English
      arrow-up
      7
      ·
      7 days ago

      When making an application instead of coding for one platform you have to code for 5 and also convince Apple and Google to accept your app (Nextcloud is really feeling this one).

      Meanwhile HTML + JavaScript works on most smart fridges.

  • bitwolf@sh.itjust.works
    link
    fedilink
    English
    arrow-up
    5
    ·
    edit-2
    7 days ago

    Done.

    I feel that it may be helpful to try to capture the motivation for each type of service.

    For example. The reasons I host a media server are different than the reasons I host a photo backup solution.

    Thank you for doing this research. I dream of the day that self hosting becomes as easy as spinning up a consumer router.

    • SelfhostedResearch@lemmy.worldOP
      link
      fedilink
      English
      arrow-up
      4
      ·
      6 days ago

      Thank you very much – I really appreciate your participation! Yes, the results will be published as part of my PhD dissertation, and also in one of the peer-reviewed journals in the field of Computer Science. Once everything is finalized and publicly available, I’ll definitely share a summary and a link to the publication here as well. Thanks again for your interest and support!

  • raldone01@lemmy.world
    link
    fedilink
    English
    arrow-up
    3
    ·
    7 days ago

    Done but I felt lots of questions to be very similar. Maybe there is a form platform that can show only a subset of control questions for every survey.

    • SelfhostedResearch@lemmy.worldOP
      link
      fedilink
      English
      arrow-up
      9
      ·
      7 days ago

      Thank you for completing the survey and for your thoughtful feedback. The similarity between some questions is intentional and follows common scientific practice when measuring complex or abstract concepts. Using multiple, slightly varied items that target the same construct increases the reliability and validity of the data by capturing subtle nuances and reducing the influence of random response variation. While your suggestion to show only a subset of such items through adaptive platforms is valid and worth exploring, fixed item sets are generally preferred in research settings to ensure consistent and robust measurement. We appreciate your input and will consider it in future survey design improvements.

  • one_knight_scripting@lemmy.world
    link
    fedilink
    English
    arrow-up
    3
    arrow-down
    1
    ·
    6 days ago

    Yeah, those data questions are really loaded. I don’t host for privacy or what not. It’s because of a learning objective, to study, experiment, and run automated stock trading algorithms. I don’t exactly have anything to hide from private companies.

    • SelfhostedResearch@lemmy.worldOP
      link
      fedilink
      English
      arrow-up
      1
      ·
      6 days ago

      Thank you for your comment. The use of similar statements is a common practice in this type of research, as it helps to better capture different aspects of a construct and ensures reliability. I understand that privacy may not be your personal motivation for self-hosting, and that’s perfectly fine. The purpose of this survey is to explore a variety of factors that can influence why individuals choose to self-host, and to determine the relative importance of each. Even if certain factors don’t apply to you, your responses contribute to a broader understanding of the motivations behind self-hosting. Thank you again for taking the time to complete the survey.

  • daytonah@lemmy.ml
    link
    fedilink
    English
    arrow-up
    1
    ·
    7 days ago

    Submitted.

    I am in finance/accounting/consulting and I run one self hosted small server at home to manage my home machines remotely and use it with combo of proton to do most things.

    While I am in the process of de-googleing I haven’t self hosted photo or email yet as proton has been working okay for the time being. One step at a time I guess for me… I am not in IT so it requires time for me to read up on things and set them up (rtfm indeed)…(yes we all know what proton CEO said… )…let’s see how it pans out.and this has been quite a journey for me. I am still having issues with family and friends not wanting to use jitsi (that I am using for calls or contact me on signal). I have basically thrown WhatsApp and anything I don’t care about within beeper under a work profile (android) I.e. in a sandbox / appimage (laptop).

    Personally if next cloud allowed self hosting with self email hosting (whatever the technical terminology is) that would have been (chef’s kiss) muah…I would have jumped on it first hand…

    Sorry I am not in IT so my jargon usage might be sub par.

    • kuhli@lemm.ee
      link
      fedilink
      English
      arrow-up
      4
      ·
      7 days ago

      Honestly, email is the one thing I wouldn’t self host, it’s just so much hastle getting it set up right and can be incredibly difficult to have emails you send not end up in spam. Just not worth the hastle imo.

  • starshipwinepineapple@programming.dev
    link
    fedilink
    English
    arrow-up
    100
    ·
    edit-2
    8 days ago

    I submitted a response but if i may give some feedback, the second portion brings up:

    I am willing to pay a substantial amount for hardware required for self-hosting.

    This seemed out of place because there were no other value related questions (iirc). Such as:

    • I believe self hosting saves me money in the short term
    • i believe self hosting saves me money in the long run

    I’m sure you could also think of more. But i think it’s pretty important because between cloud service providers and any non-free apps you want to use, it can be quite costly compared to the cost of some hardware and time it takes to set things up.

    The rest of my responses don’t change but if you’re wanting to understand the impact of money in all of this, i think some more questions are needed

    Best of luck!

    • AbnormalHumanBeing@lemmy.abnormalbeings.space
      link
      fedilink
      English
      arrow-up
      51
      ·
      edit-2
      8 days ago

      I believe self hosting saves me money in the short term
      i believe self hosting saves me money in the long run

      I can add to the voices here that have this as one big consideration. With some second-hand hardware, it’s very cheap to set up almost unlimited cloud space for personal use.

      • Sixty@sh.itjust.works
        link
        fedilink
        English
        arrow-up
        13
        ·
        edit-2
        8 days ago

        That, and a lot of questions about ease of use too, but I answered them neutral because some are bears to set up, others are one click. Idk it depends.

      • ulterno@programming.dev
        link
        fedilink
        English
        arrow-up
        3
        ·
        8 days ago

        And unless you are expecting significant traffic, you can use an old Core2Quad with 2GB RAM and it will work just fine.

    • modus@lemmy.world
      link
      fedilink
      English
      arrow-up
      12
      ·
      8 days ago

      Not to mention that a lot of self-hosting can be done on hardware you already had laying around.

    • rekabis@lemmy.ca
      link
      fedilink
      English
      arrow-up
      8
      ·
      8 days ago

      And I self-host precisely because of the money I save using surplussed hardware. I have a symmetrical 1Gb SOHO fibre connection from my ISP, so I can host whatever the hell I want, I just need to stand it up. And a beefy older system with oodles of RAM is perfect for spinning up VMs of various platforms for various tasks. This saves me craploads of money over even a single VM on cloud platforms like Vultr. Plus, even if I were to support a “heavy” service sufficiently in demand to warrant its own iron, it still costs me less than a year’s worth of hosting to obtain a decent platform for that service to run on all by it’s lonesome.

      My only cloud costs end up being those services which are distributed for redundancy and geographical distance, such as DNS and caching CDNs.

    • modifier@lemmy.ca
      link
      fedilink
      English
      arrow-up
      8
      ·
      8 days ago

      My only quibble would be to swap “pay” for “invest” which captures both the dynamic of up front expense and expected savings from ending recurring subscription fees. That’s how I look at it. Every penny I put into my own digital sovereignty is an investment that will yield returns both financial and otherwise.

    • gedaliyah@lemmy.world
      link
      fedilink
      English
      arrow-up
      4
      arrow-down
      1
      ·
      8 days ago

      Second this - so far it has cost me money, but as I am able to cancel more subscription services, the savings will add up.

  • Voroxpete@sh.itjust.works
    link
    fedilink
    English
    arrow-up
    32
    ·
    8 days ago

    People who influence my behavior think that I should use cloud services.

    This question is going to get bad data. No one likes to think of themselves as being influenced. A more effective phrasing would be “…people I trust…”

    • SelfhostedResearch@lemmy.worldOP
      link
      fedilink
      English
      arrow-up
      28
      ·
      8 days ago

      Thanks for the comment — that’s a valid observation, and I understand how the wording might feel a bit awkward.

      Just to clarify: the statement comes from a standardized construct called Subjective Norms, and follows the phrasing from the paper “A Theoretical Extension of the Technology Acceptance Model” by Venkatesh & Davis (2000).

      For all independent variables in the survey, we relied on validated scales and established practices from prior scientific research, to ensure consistency and reliability. That said, I really appreciate your feedback. :)

  • anarcho_vroom@lemmy.dbzer0.com
    link
    fedilink
    English
    arrow-up
    31
    ·
    8 days ago

    I’m a little concerned about selection bias (because obviously).

    I also want to know about people who are not aware of self-hosting. If they’d be interested or even try.

    • SelfhostedResearch@lemmy.worldOP
      link
      fedilink
      English
      arrow-up
      29
      ·
      8 days ago

      That’s a very valid concern, and you’re absolutely right to bring it up.

      One existing study that surveyed the general population found that about 8.4% of respondents were self-hosting users, which means that in order to get enough self-hosters from the general population for meaningful analysis, we’d need a very large sample.

      Unfortunately, we don’t have the funding or resources to conduct such large-scale research through a representative panel or agency. That’s why this study is focusing on communities where self-hosting is already discussed, like this one.

      That said, we’re definitely aware of this limitation, and we’re also sharing the survey in broader, more general-interest online communities where we expect non-self-hosters (or people unfamiliar with the concept) to be more present. This will allow us to include comparisons between the two groups in the analysis.

      Really appreciate your thoughtful comment — thanks!

      • grue@lemmy.world
        link
        fedilink
        English
        arrow-up
        20
        ·
        8 days ago

        One existing study that surveyed the general population found that about 8.4% of respondents were self-hosting users

        Wow! That’s a lot higher than I would’ve expected. My guess would’ve been about 1%, or maybe even an order of magnitude or so less than that.

          • kernelle@0d.gs
            link
            fedilink
            English
            arrow-up
            5
            ·
            8 days ago

            Thanks for the source, super interesting read! I would’ve guessed 1-5% as well.

          • Mniot@programming.dev
            link
            fedilink
            English
            arrow-up
            3
            ·
            8 days ago

            Thanks for linking that. Reading the paper, it looks like the majority of the “self-host” population they’re capturing is people who have a WordPress site. By my reading, the wording of the paper would disqualify a wordpress.com-hosted site as “self-hosted”. But I’d be very suspicious of their methodology and would expect that quite a few people who use WP-hosted reported as self-hosted because the language is pretty confusing.

        • mic_check_one_two@lemmy.dbzer0.com
          link
          fedilink
          English
          arrow-up
          7
          ·
          edit-2
          8 days ago

          My guess is that it also included things like the 12 year old hosting a Minecraft server for their friends. Which, to be clear, is a totally valid self-hosting use case.

        • nfh@lemmy.world
          link
          fedilink
          English
          arrow-up
          6
          ·
          8 days ago

          I suspect there’s a tendency of experts in something to think of people who do it narrowly as people doing at least as much as they are.

          The people who have a bunch of docker services, or complex multi-machine infrastructure are self-hosted software users, and probably in that 1-2% range. People who heard piholes are useful, so they bought a pi 3 and set it up are self-hosted software users. Somebody using an old desktop they got on Facebook marketplace for running Plex media are self-hosted software users… and so on. So are the people in their houses, some of their friends and family.

          Using that inclusive definition, being closer to 10% than 1% makes sense to me.

    • Macallan@lemmy.world
      link
      fedilink
      English
      arrow-up
      3
      ·
      8 days ago

      I sort of fit that category. I am aware of self-hosting, even somewhat interested. But I know absolutely nothing about it, and if I’m being honest, too lazy to research it.

      Truthfully, I haven’t owned my own PC/Laptop in over a decade. I just use the one I get from work if I need to do something on a computer. I preferred gaming on a PS4/5 so I could just relax on the couch with a controller instead of sitting in a chair at a desk. I recently got a steam deck and love it. I want to poke around desktop mode some more so I can get more familiar with Linux.

  • rtxn@lemmy.world
    link
    fedilink
    English
    arrow-up
    22
    ·
    edit-2
    8 days ago

    I use self-hosted services in the following categories as much as possible…

    That question could really use a “not applicable” option. I don’t operate any home automation solutions, so any answer from me would be invalid, and neutral answers because the item is not relevant will appear the same as neutral answers because I use both self-hosted and externally hosted solutions (e.g. Mullvad for privacy and Tailscale to get around CGNAT).

    • SelfhostedResearch@lemmy.worldOP
      link
      fedilink
      English
      arrow-up
      8
      ·
      8 days ago

      Thanks for the comment: that’s a really good point to raise.

      Just to clarify: the statement “I use self-hosted services in the following categories as much as possible” is meant to reflect how fully you make use of self-hosted solutions in each area. A response like “Strongly agree” would indicate that you actively use and take full advantage of self-hosting in that category.

      If you don’t use solutions in a particular category at all — whether that’s because you don’t need them, aren’t interested, or use only external services — then it’s completely appropriate to select a disagreeing option (e.g. “Disagree” or “Strongly disagree”). In this context, lower agreement simply indicates low or no use, regardless of the reason.

      From a methodological standpoint, the data will be analyzed using structural equation modeling (SEM). This approach requires a complete set of responses across the measured constructs. If we included a “not applicable” option, it would create missing values in the dataset and potentially lead to excluding the entire response for that part of the analysis — which would significantly reduce the usable sample size.

      That said, I really appreciate your feedback! :)

      • grue@lemmy.world
        link
        fedilink
        English
        arrow-up
        8
        ·
        8 days ago

        Be prepared for some respondents to choose the middle option as a proxy for “not applicable,” because that’s what I did.

        • chunkystyles@sopuli.xyz
          link
          fedilink
          English
          arrow-up
          3
          ·
          edit-2
          8 days ago

          I chose the middle option for things I’m not hosting, but could see myself hosting in the future.

      • curve_empty_buzz@discuss.tchncs.de
        link
        fedilink
        English
        arrow-up
        4
        ·
        8 days ago

        I get why you’re taking that approach but you risk serious misclassification bias. The replies have stated people are using both “disagree” and “neither agree nor disagree” to indicate they are not hosting a particular kind of service. From your description of your research it sounds like disagree and strongly disagree should indicate that the individual uses company hosted services instead of self hosted services for those domains. The relationship between views on privacy and types of services self hosted is going to be confounded by that.

      • towerful@programming.dev
        link
        fedilink
        English
        arrow-up
        3
        ·
        8 days ago

        Yeh, I took “don’t agree or disagree” to be the N/A.
        It seemed the most neutral.
        I don’t really use anything for bookmark sharing/management. So I don’t strongly disagree or strongly agree with self hosting it.

      • TheBeesKnees@lemmy.sdf.org
        link
        fedilink
        English
        arrow-up
        2
        ·
        8 days ago

        If this was the expectation, then it should have been a checklist and/or had N/A available. I don’t think your data for this section will be accurate since myself/others replying did not use it this way.

        I used “disagree” as “I am using a non-self hosted service for this,” the middle as “N/A” and the agree as “I am hosting a service for this”…