Stories
Slash Boxes
Comments

News for nerds, stuff that matters

Ask Dan Kusnetzky About Linux Server Counts

Posted by Roblimo on Thu Jun 14, 2001 11:00 AM
from the is-it-science-art-or-marketing? dept.
How many people run Linux as a desktop OS? How many servers run Linux? Is the Linux server market share 8.6 percent or 24 percent or somewhere in between? Dan Kusnetzky is a heavily quoted analyst at IDC who wrestles with questions like these for a living. This is your chance to find out how analysts come up with all those numbers -- and why analysts seem to disagree with each other so often. One question per post, please. We'll send 10 of the highest-moderated ones to Dan, and post his answers as soon as we get them back.
This discussion has been archived. No new comments can be posted.
Display Options Threshold:
The Fine Print: The following comments are owned by whoever posted them. We are not responsible for them in any way.
(1) | 2
  • Possibly a stupid question... by Anonymous Coward (Score:1) Thursday June 14 2001, @08:04AM
  • Inquiring minds want to know... by Anonymous Coward (Score:1) Thursday June 14 2001, @07:38AM
  • Re:Factoring in the Captive Audience by Hall (Score:1) Thursday June 14 2001, @11:05AM
  • What about undercounting? by RelliK (Score:1) Thursday June 14 2001, @11:54AM
  • Confidence level by bstadil (Score:1) Thursday June 14 2001, @07:59AM
  • Re:What about the so-called "third world"? by Rotten (Score:1) Thursday June 14 2001, @09:29AM
  • How often do you backcheck by lazlo (Score:1) Thursday June 14 2001, @12:33PM
  • Re:Anagram by jimmyphysics (Score:1) Thursday June 14 2001, @07:55AM
  • Re:I just have to ask... by DGolden (Score:1) Thursday June 14 2001, @08:40AM
  • Follow The Money by Codeine (Score:1) Thursday June 14 2001, @10:39PM
  • Who? by ONOIML8 (Score:1) Thursday June 14 2001, @06:18PM
  • Re:*what* consititutes a Linux server? by Mojo Geek (Score:1) Thursday June 14 2001, @07:14PM
  • Re:Sold vs Installed by Doppleganger (Score:1) Thursday June 14 2001, @09:35AM
  • Re:*BSD is dying by Ded Bob (Score:1) Thursday June 14 2001, @09:21AM
  • Re:Question(ADDNUM) by bmo (Score:1) Saturday June 16 2001, @01:14PM
  • Great point. by dave-fu (Score:1) Thursday June 14 2001, @09:30AM
  • Most Important Question by zpengo (Score:1) Thursday June 14 2001, @07:05AM
  • don't do it by just 'hits' by jon_c (Score:1) Thursday June 14 2001, @11:21AM
  • Do analysts have too much influence? by bwoodring (Score:1) Thursday June 14 2001, @08:36AM
  • FTP downloads? by Lussarn (Score:1) Friday June 15 2001, @01:33AM
  • Re:Factoring in the Captive Audience by Caspuh (Score:1) Thursday June 14 2001, @10:34AM
  • Re:*what* consititutes a Linux server? by jgerman (Score:1) Thursday June 14 2001, @11:46AM
  • BEST FRONT by phrostie (Score:1) Thursday June 14 2001, @08:04AM
  • What's More Important? by north.coaster (Score:1) Thursday June 14 2001, @09:06AM
  • Re:Doesn't IDG host Linux world? by north.coaster (Score:1) Thursday June 14 2001, @09:13AM
  • Who uses this data? by north.coaster (Score:1) Thursday June 14 2001, @09:01AM
  • Statistical Seriousness by baldusi (Score:1) Thursday June 14 2001, @12:17PM
  • Response to "secondly" by Army No Va (Score:1) Friday June 15 2001, @03:43AM
  • Beyond the Distro and Back by PHanT0 (Score:1) Thursday June 14 2001, @08:04AM
  • Re:Sold vs Installed by grue23 (Score:1) Thursday June 14 2001, @08:12AM
  • Gartner define the parameters - up front by mushrooms (Score:1) Friday June 15 2001, @01:44AM
  • Server vs workstation by SnapperHead (Score:1) Thursday June 14 2001, @02:13PM
  • Re:We Need To Know... by tomknight (Score:1) Thursday June 14 2001, @07:38AM
  • Doesn't IDG host Linux world? by Billly Gates (Score:1) Thursday June 14 2001, @08:19AM
  • OS stats fishy... by jpellino (Score:1) Thursday June 14 2001, @11:46AM
  • ethical conscience? by irw (Score:1) Friday June 15 2001, @02:26AM
  • Numbers vs. What Works by bitva (Score:1) Thursday June 14 2001, @09:04AM
  • Re:Here's a simple one... by update() (Score:1) Thursday June 14 2001, @07:23AM
  • Which is it... by falzer (Score:1) Friday June 15 2001, @01:27AM
  • how many computers run Linux? by cabalamat2 (Score:1) Thursday June 14 2001, @03:45PM
  • Good Numbers or Bad Numbers? by chicagothad (Score:1) Thursday June 14 2001, @07:41AM
  • Impossible!!! by Aapje (Score:1) Thursday June 14 2001, @09:21AM
  • Study funding by Apreche (Score:1) Thursday June 14 2001, @07:12AM
  • Re:How do you discount overcounting? by rfsayre (Score:1) Thursday June 14 2001, @12:07PM
  • Re:Question(ADDNUM) by chris_mahan (Score:1) Thursday June 14 2001, @12:47PM
  • Electronic Unbiased Polls by Greenisus (Score:1) Thursday June 14 2001, @08:03AM
  • Re:Data origins by JohnSmith1138 (Score:1) Thursday June 14 2001, @08:48AM
  • Embedded by nate1138 (Score:1) Thursday June 14 2001, @07:22AM
  • What is Linux's market outlook? by Supa Mentat (Score:1) Thursday June 14 2001, @08:43AM
  • Re:A look at the numbers by jramos (Score:1) Thursday June 14 2001, @11:22AM
  • My question by kraf (Score:1) Thursday June 14 2001, @08:01AM
  • NMAP anyone? by rolex2600 (Score:1) Thursday June 14 2001, @08:08AM
  • Server vs. desktop? by SilentChris (Score:1) Thursday June 14 2001, @12:30PM
  • Re:I just have to ask... by werbz (Score:1) Sunday June 17 2001, @11:10AM
  • Re:I just have to ask... by werbz (Score:1) Sunday June 17 2001, @11:17AM
  • Re:Company size. by mhdawso (Score:1) Thursday June 14 2001, @11:28PM
  • Re:I just have to ask... by Alessandor (Score:1) Friday June 22 2001, @01:54AM
  • Re:why not just ask? by Anonymous Coward (Score:2) Thursday June 14 2001, @09:15AM
  • Blather... by sheldon (Score:2) Thursday June 14 2001, @12:20PM
  • Re:Funding by swb (Score:2) Thursday June 14 2001, @01:03PM
  • Re:Reproducibility by HiThere (Score:2) Thursday June 14 2001, @10:45AM
  • Re:*what* consititutes a Linux server? by HiThere (Score:2) Thursday June 14 2001, @10:48AM
  • Re:How do you discount overcounting? by HiThere (Score:2) Thursday June 14 2001, @11:04AM
  • Ages. by viper21 (Score:2) Thursday June 14 2001, @07:32AM
  • Re:I Have a Few.... by toofast (Score:2) Thursday June 14 2001, @08:05AM
  • Re:*what* consititutes a Linux server? by gmhowell (Score:2) Thursday June 14 2001, @08:34AM
  • What good are these numbers? by Enonu (Score:2) Thursday June 14 2001, @07:11AM
  • How are copies and discards counted? by Army No Va (Score:2) Thursday June 14 2001, @09:37AM
  • Loaded question by connorbd (Score:2) Thursday June 14 2001, @07:49AM
  • Better link for "Dan Kusnetzky" by dmccarty (Score:2) Thursday June 14 2001, @12:23PM
  • Factory installed Windows by rneches (Score:2) Thursday June 14 2001, @01:23PM
  • Re:Factory installed Windows by rneches (Score:2) Thursday June 14 2001, @05:28PM
  • Effectiveness of MS' Campaigns by Fatal0E (Score:2) Thursday June 14 2001, @09:18AM
  • Error in data collection by erroneus (Score:2) Thursday June 14 2001, @10:04AM
  • I Have a Few.... by CrazyLegs (Score:2) Thursday June 14 2001, @07:11AM
  • The Role of Analysts by salesgeek (Score:2) Thursday June 14 2001, @02:45PM
  • Re:Sold vs Installed by Ayende Rahien (Score:2) Thursday June 14 2001, @08:17AM
  • Sold vs Installed by Ayende Rahien (Score:2) Thursday June 14 2001, @07:47AM
  • Re:Impossible!!! by Waffle Iron (Score:2) Thursday June 14 2001, @11:05AM
  • Faster servers hurting themselves by Tachys (Score:2) Thursday June 14 2001, @05:57PM
  • Re:More breakdown needed by esnyder (Score:2) Thursday June 14 2001, @10:33AM
  • by Forge (2456) on Thursday June 14 2001, @07:37AM (#151222) Homepage Journal
    How did you go about researching your figures? Who was involved? How many people companies or vendors did you ask? What statistics did you track?

    In short how did you come up with your figures. We could ask about the other figures too but since you didn't publish those someone else may have to answer that.
  • by superid (46543) on Thursday June 14 2001, @07:19AM (#151223) Homepage
    How do you handle large blocks of users that are forced (often against their will and better judgement) to use a particular operating system? For example, 360,000 Navy users are forced [eds.com] to use Windows 2000.

    SuperID
    Free Database Hosting For Developers [freesql.org]

  • by ant-1 (120272) <ant-1NO@SPAMpouch.name> on Thursday June 14 2001, @07:30AM (#151224)
    During Linux servers market share study (or similar studies involving Linux), are you concerned about the fact that there's no big Linux company who will ever buy you such work, meanwhile Microsoft or other giants sure will ask again for it in the future ?
    Does it affects you or your team work ?
  • by YU Nicks NE Way (129084) on Thursday June 14 2001, @08:58AM (#151225)
    I've read your report, and I didn't find any description of the mechanism by which you came up with your estimate of the average number of times that a given purchased copy of Linux is installed. That's probably my own error, but since you're answering questions, would you be willing to enlighten me?

    You use the number "15". Frankly, I'm surprised that it is so big. Upon what data did you base that estimate? Who did you interview to get it? I realize that any such estimate would have to account both for the very large number of installs at large colocation and/or service provider shops, as well as the number of untrackable network installs that take place. But it would also need to account for the number of times where a machine had Linux installed upon it, was used as an experimental development platform, and then was wiped, not to mention the number of cases where somebody bought a distribution, and then never installed it at all.
  • by stox (131684) on Thursday June 14 2001, @07:45AM (#151226) Homepage
    I would like to see a breakdown of server O/S weighted by usage. For example, a server that receives 1000hits/hour would have 1000 times more weighting than a server which receives 1 hit/hour. Such a weighting would give a much more honest view of usage. In other words, which server O/S is getting the most work done. I suspect this would lead to significantly different results from what we have seen so far.
  • linux ports. (Score:3)

    by saintlupus (227599) on Thursday June 14 2001, @08:59AM (#151227) Homepage

    Many Linux desktops have ftp, telnet & http ports open, so do they count as severs too?

    no, they count as targets for l33t haX0rs, especially since the people who just installed mandrake from that wal mart cd don't even know they're open or use them...

    --saint
    ----
  • by motorsabbath (243336) on Thursday June 14 2001, @07:36AM (#151228) Homepage
    When most desktop and server Linux users are building their own boxes, how can sales of server and pc systems be relevant? Most people I know that use Linux (myself excluded) first bought a PC with wDOS on it, removed that OS and installed something else. What kind of formula do you use to make up for this disparity between systems sold and systems built ?
  • Question (Score:4)

    by HeUnique (187) <hetz-home@cobol2jav[ ]om ['a.c' in gap]> on Thursday June 14 2001, @10:17AM (#151229) Homepage
    Hi Dan,

    According to IDC figures, the Linux desktop market share was 5% and now it's less then 2%. Gartner numbers are of course differently (way lower if I'm not mistaken)

    Yet, when I look at developments of projects like XFree, KDE, Gnome, Linux kernel - or when you get the daily list from freshmeat, or even talking to the the ISP who host mirrors of ISO images of Redhat, Mandrake etc - then you see that linux get FAR more then 2%. Hell - if it was 2% and you account the developments of Linux - then each developer works 25 hours per day on a porject!

    So, as you can see - the numbers here are definately wrong here - and those numbers are actually hurting the Linux community. If an ABCD company wants to make a software for the Linux desktop and they see those IDC figures - then they will say something like "oh, 2%? no thanks - we'll make it for Mac - they are %5+", and we'll loose..

    Comments?
  • Company size. (Score:4)

    by viper21 (16860) <scott@iqfoundr y . c om> on Thursday June 14 2001, @07:36AM (#151230) Homepage
    Is there a relationship between company size and linux use?

    Do we have evidence of any companies with multi-million dollar revenues that rely on linux solutions for their servers or do most companies that use linux servers do it because they lack the money? (choice/only option)

    -S

    Scott Ruttencutter
  • by thewiz (24994) on Thursday June 14 2001, @07:47AM (#151231)
    Dan,
    How does IDC determine if the answers they are getting are valid? What I mean by this is how does your organization insure that the people they are surveying being truthful in their responses? I can see scenarios where a well co-ordinated group of people could be swayed by a vendor to influence the results of a survey, especially in a field as specialized as the IT industry.
    Also, if you suspect that the results of a survey are tainted, do you publish it anyway (with caveats included, of course) or redo it (at your time and expense)?
  • Re:Funding (Score:4)

    by mpe (36238) on Thursday June 14 2001, @07:40AM (#151232)
    In studies of market share(or studies in general), we often hear quotes about who funded them. This seems to somehow imply that those who funded the study had some influence in how the data was gathered/interpreted.

    Even if the funder does not infulence the questions asked they may well control if the "results" see the light of day.
  • by mpe (36238) on Thursday June 14 2001, @07:46AM (#151233)
    I have often wondered how biased polls are based on the questions asked, the demographics of the people polled, etc. When results about polls are made public, is it also possible to obtain information about how the poll was conducted in a simple, by request method?

    It's quite trivial for methodology to affect the results. That is why in any kind of scientific publication the method is considered at least as important as any conclusion.
    Not only is no methodology available "conclusions" are being called "results".
  • by adubey (82183) on Thursday June 14 2001, @08:00AM (#151234)
    Hi Dan,

    This is probably going to be a tough question for you (if you ever get it, that is - given the bias on Slashdot, this probably won't be modded up).

    A big problem I see with your methodology is that you probably overcount Linux server shipments. From what I understand (I may be incorrect here) you count each sale of Linux as a server shipment. However, many of these copies may be tested but never used, a great many are used in home computers or development workstations. The problem is, unlike with Windows or OS/2, there is no ``client version'' vs. a ``server version''. There is no easy way to tell if a copy of Linux is used in a server environment or a client environment by sales figures alone. So my question is: given total Linux sales figures, how do you estimate the number used in server environments?

    Secondly, is it possible that the descrepancy between your numbers and Gartners' is due to a problem in the way you estimate Linux server sales?

  • by cheezus (95036) on Thursday June 14 2001, @07:42AM (#151235) Homepage
    What is the real world impact of these numbers? Is it just an ego boost for the producer of product X? Does it influence buying? Does it change R&D strategy? Especially since the numbers differ so much from analasyst to analasyst, how does anyone make any sort of decisions bade on this information?

    ---

  • why not just ask? (Score:4)

    by peccary (161168) on Thursday June 14 2001, @07:12AM (#151236)
    The technical approach of talking to publicly accessible servers and attempting to fingerprint the OS is fraught with methodological problems.

    The naive approach of asking hardware vendors how many units of hardware ship with what OS is obviously flawed.

    So why not just do the dumb brute-force thing? Dial phone numbers at random and ask people what they use? Too expensive?

    How much is accurate data worth, anyway?
  • Trends (Score:4)

    by s20451 (410424) on Thursday June 14 2001, @07:15AM (#151237) Journal
    Do you have any insight about trends that can be gleaned from these surveys - that is, regardless of what number one uses for the percentage of users that adopt Linux, is it static, taking off, steadily increasing, or what?
  • by Anonymous Coward on Thursday June 14 2001, @07:15AM (#151238)
    At what point does a computer become a server? Many Linux desktops have ftp, telnet & http ports open, so do they count as severs too?
  • by HeUnique (187) <hetz-home@cobol2jav[ ]om ['a.c' in gap]> on Thursday June 14 2001, @10:27AM (#151239) Homepage
    I just remembered an Idea that a friend of mine suggested:

    What if IDC could work with the Linux distributions (RedHat, Mandrake, SuSE, TurboLinux, Debian) to add a small program which will run after the first internet connection has been succsessfull..

    When this program runs - it will ask the user to "register" his copy of the distribution. If it has been purchased from one of the distributors - then the user can add his serial number. Some other questions like will this distribution be used as a server, a workstation, or combination of the 2, or a development workstation. The survey SHOULD be annonymous (unless the user wants to give some details about himself)

    By that way - the distributors can give the numbers back to IDC - and IDC can publish a report which will tell that the number of Linux installations - and that number is X. X is combined of Y free download version and Z purchased copies of Linux.

    What do you think, Dan? what the slashdot readers think about it?
  • Data origins (Score:5)

    by m2 (5408) <ib9a2f46001@sneakemail.com> on Thursday June 14 2001, @07:14AM (#151240) Journal

    Do you base your data mostly on marketing analysis or do you actually go a pay a consultor to scan machines on the net? If there are scans involved, how do you pick the IP blocks to be scanned and what's the uncertainty associated with such a method (and how is this uncertainty guessed)? If there are no scans involved, why not? If this is "maket analysis", can you defined that for me? Which factors are involved? And a different question: who's the target market for this kind of study? How much does such a thing cost?

  • Reproducibility (Score:5)

    by DragonWyatt (62035) on Thursday June 14 2001, @07:26AM (#151241) Homepage
    Has any thought been put into doing a "reproducibilty challenge" against Gartner? That is,
    1. Ask Gartner their methodology,
    2. Document it so that it can be reproduced;
    3. Make sure and get Gartner to say "yep that's how we did it".
    4. Dan&Co reproduces the methodology and compares the numbers
    Might be overly scientific. Maybe consider it an "Open Source" version of market research because of the peer review and verification. Some may think this approach has no place in the market research area.

    But, I (for one) think it would be interesting to call Gartner's Bluff (if indeed that's what this is). I personally place them (and ZD, etc) into the Shill [tuxedo.org] category.

    Thoughts anyone?
  • by Ded Bob (67043) on Thursday June 14 2001, @07:04AM (#151242) Homepage
    Might he have the stats on the BSD's. People from the BSD community are curious. :)

    TIA
  • by dvk (118711) on Thursday June 14 2001, @07:13AM (#151243) Homepage
    Do you think that there exists a possibility (or can even provide examples of) self-fullfilling analysis, such as "analysis says X is losing market share=>people get skeptical about X=>X loses market share although it may not have done so otherwise"?

    If it is possible or already happened, do analysts in general (and you in particular) find it a worrisom possibility, and if so, are there any attempts/ideas to deal with the issue?

    Thanks,
    DVK

  • Funding (Score:5)

    by ritlane (147638) on Thursday June 14 2001, @07:08AM (#151244) Homepage
    In studies of market share(or studies in general), we often hear quotes about who funded them. This seems to somehow imply that those who funded the study had some influence in how the data was gathered/interpreted.

    My question is: Do those who fund a study influence how the study turns out (ie. Microsoft studies show higher MS market share). Or is it that these corperations only decide to fund groups who they know will most likely return results in thier favor.


    ---Lane
  • by dmccarty (152630) on Thursday June 14 2001, @08:01AM (#151245)
    Mr. Kusnetzky,

    I have a question on the area of predictions in general. For example, fellow IDC analyst Jill House has been severaly negative on Palm over the years, with regard to the Win CE operating system and devices. A sample quote from her in Feb. 2000 read, "If I was Palm, I would be beside myself with panic. [zdnet.com]"

    The issue is, that over the last 3 - 4 years she's been predicting the demise of Palm and the rise of Win CE, a claim that has never materialized. Who verifies the reliability of these predictions and keeps the analysts accountable. With the frequent sound bytes and one-liners that they give to the press, these analysts have significant influence over public perception of the issues. But how is policing done when the analysts don't analyze very well?

    Sincerely,
    Daniel McCarty
    Palm OS Developer

  • by Kareem Abdul-Lamarr (178445) on Thursday June 14 2001, @07:42AM (#151246)
    Do these analyses factor in the so-called third world? Most of these analyses are US-centric or some times do include the continent across the pond but what about Africa and Asia? Do these analyses *really* take inputs from these continents?
  • by RareHeintz (244414) on Thursday June 14 2001, @07:04AM (#151247) Homepage Journal
    Do you think Gartner Group caved in ethically when Microsoft paid them for this study? That is, do you believe it to be a purchased fabrication? Or might it be an honest mistake, a difference in data analysis, or something less sinister?

    Thanks for your time,
    - Brad Heintz
    --

  • Polling questions (Score:5)

    by cavemanf16 (303184) <cavemanf16&gmail,com> on Thursday June 14 2001, @07:30AM (#151248) Homepage Journal
    I have often wondered how biased polls are based on the questions asked, the demographics of the people polled, etc. When results about polls are made public, is it also possible to obtain information about how the poll was conducted in a simple, by request method? Now if the answer to that question is, no, how much can we rely on polls, since we have no way of verifying if the questions asked and the people interviewed were heavily biased to favor one outcome over another? (Such as in the recent large discrepancies of the 8% vs. 24% use of Linux as a server results that we've seen on Slashdot recently).
  • by BillyGoatThree (324006) on Thursday June 14 2001, @07:09AM (#151249)
    When People magazine does an issue devoted to "what's hot" in fashion, do they interview Jane Doe from Des Moines, Iowa? No.

    So why are OS numbers reported with equal rating? Not all users are equally suited to *choose* an OS, therefore not all users *choices* are equally interesting. I'd really like to see a breakdown of OS by user-type (levels of education, field of degree if applicable, occupation, etc). Keep in mind this applies just as much to business. A technology company presumably put more informed thought into their choice of server than an art supply house or whatever.
    --
(1) | 2