New OS X uses Windows file sharing by default

New OS X uses Windows file sharing by default(arstechnica.com)

178 points by nkhumphreys 13 years ago | 57 comments

rogerbinns 13 years ago |

SMB has an extension mechanism and SMB 1 has support for Unix extensions for over 15 years - I was the author of the original Unix extensions spec. You can get full Unix semantics using them (links etc).

The predominant form of extension is an "info level". Somewhat analogous to a data structure like that returned from stat, the numeric info level controls what structure is returned (or supplied). Microsoft had a tendency to add new info levels that correspond to whatever the in-kernel data structures were in a particular release rather than longer term good design.

The general chattiness comes from their terrible clients like Windows Explorer (akin to Finder for Mac folk). I once did a test opening a zip file using using Explorer. If you hand crafted the requests it would have 5 of them - open the file, get the size, read the zip directory from the end of the file, close it. Windows XP sent 1,500 requests and waited synchronously for each one to finish. Windows Vista sent 3,000 but the majority were asynchronous so the total elapsed time was similar.

I worked on WAN accelerators for a while where you can cache, read ahead and write behind, in order to provide LAN performance despite going over WAN links. In one example a 75kb Word memo was opened over a simulated link between Indonesia and California. It took over two minutes - while instantaneous with a WAN accelerator. The I/O block size with SMB is 64kb so they could have got the entire file in two reads, but didn't.

If anyone is curious about what it was like writing a SMB server in the second half of the nineties I wrote about it at http://www.rogerbinns.com/visionfs.html

michael_miller 13 years ago | |

Do you know the cause of the 3k requests which Vista made? Do you have a sane theory why these were occurring? Also, do you have any suggestions for better clients to use?

rogerbinns 13 years ago | | |

> Do you know the cause of the 3k requests which Vista made? Do you have a sane theory why these were occurring?

Backwards compatibility and layers of indirection.

Microsoft has always made great efforts for backwards compatibility - Raymond Chen's blog is a good source of stories. Quite simply if you upgraded Windows and apps stopped working then you'd blame Windows. Of course it is almost always the apps relying on undocumented behaviour, ignoring documentation, relying on implementation artifacts etc. This means a lot of code to detect and work around problems in other components. For a networked filesystem client the simplest way is sending lots of requests and picking results of interest based on what comes back. Networked filesystem servers also work around client problems in various ways - eg they may return smaller block sizes than the client requested because it is known to have occasional problems. All of this builds up layers and layers of workarounds, workarounds to workarounds, having to test against OS/2 etc. SMB2 was an attempt to wipe the slate clean (no more OS/2!) but of course the crud starts building up again.

Explorer isn't a program that displays files and directories despite appearances. There are layers and layers of abstractions, parts provided by COM etc. The code that knows it wants to display the listing of a zip file is many layers away from the code that generates network requests. It is always easier to write code that does more than strictly needed than the absolute minimum necessary.

frozenport 13 years ago | |

Isn't Riverbed's entire company1[1] founded on Microsoft protocol inefficiency?

http://www.riverbed.com/

rogerbinns 13 years ago | | |

Time to air some dirty laundry. I worked for one of their competitors - Riverbed was set up after we were successful with the intention of beating us. (They eventually did mainly because we were acquired by a big company who essentially threw away the $300m they spent on us.)

But Riverbed's SMB implementation was done by people who didn't understand it, and who had a dangerous attitude. Essentially a WAN optimizer is looking at commands and responses going by and doing a beneficial man in the middle attack based on that data. One technical issue is to decide how you handle the unknown - eg a client or server speaking a dialect you haven't tested, or a command you haven't seen before/developed support for. Our attitude was always that it invalidated any caches, and worst case would disable acceleration on that connection. Riverbed just let it fly by.

An example of how that breaks things is that there is something similar to an ioctl to set ranges of a file to be zeroed out. Riverbed didn't know about that, and would keep returning the old cached contents. Similarly they didn't know about alternate data streams, and especially how they are named which breaks a naive filename caching implementation. At one point I sat down and came up with 5 separate demonstrations of how Riverbed corrupt data (ie 5 different areas of the protocol they messed up). The first one got published and Riverbed threatened to sue, as there was some Oracle inspired clause in their legal agreements! Our lawyers were chickens and that was the last of it.

My own view is that customer data is sacrosanct and I made sure we always did the right thing. They played fast and loose. However most people would blame Microsoft if there are issues rather than realising it was Riverbed's attitude causing corruption.

Riverbed did many other things right. They didn't get acquired like most in the industry, so they didn't have to deal with being squelched by an acquirer. Their marketing focussed a lot on the low end - when people already have two devices they are likely to buy more of the same (sunk cost fallacy). And they did TCP only (we did IP and TCP). TCP only makes it far easier to configure, load balance and do auto-discovery.

onedognight 13 years ago |

> Time Machine, only works over a LAN with destinations that support AFP. This is at least in part because of Time Machine's reliance on Unix hard links, and also in part because it has to be able to ensure that any OS X files with HFS+ specific metadata are correctly preserved.

This is not the reason. Time Machine does support hard links, legacy Mac metadata, and other Unix features. It does this by writing all the data into large blobs (a sparse bundle) with an embedded filesystem of its choosing (i.e. HFS+). It can use any destination filesystem for the blobs, including FAT.

__david__ 13 years ago | |

In particular, Time Machine makes large use of hard links to directories, which not many filesystems support. With HFS+ Apple can be sure that support is always there.

deathcakes 13 years ago | | |

Actually I think you'll find it makes use of hard links to files. Its basically a reimplementation of rdiff-backup, or it might be the other way round. I can assure you that no directories get hard linked, and I'm sure someone will furnish the obligatory xkcd.

Edit -- I stand corrected! It does in fact link folders as well. Also: http://xkcd.com/981/

randomdata 13 years ago | |

I even remember using Time Machine with an SMB share in Tiger. You just had to enable a configuration option to make it work. Did later versions of OS X break that functionality?

cortesoft 13 years ago | | |

You still can, but there are a few more workarounds you have to do now. You have to manually create the sparsebundle AND change configuration options.

r00fus 13 years ago | |

by "large blobs" you mean sparsebundles, right? Sparsebundles (as opposed to diskimages) can be diff'd allowing Time Machine to not only treat them as an HFS capable FS, but to isolate changes to a single block, reducing network traffic and time to backup.

glhaynes 13 years ago | | |

Sparsebundles aren't diffed in Time Machine (what would they diff against?); and backups are only done in whole-file increments, not at a block level.

apitaru 13 years ago |

Finally. Someone at Apple must be a Bukowski fan. I'm reminded of his poem "16 Bit Intel 8088 Chip" (not his greatest, but suitable):

http://bukowskiforum.com/threads/16-bit-intel-8088-chip.2791...

cpach 13 years ago | |

I had never heard of that poem before, I think it’s awesome :)

zwieback 13 years ago |

Back in the early nineties I worked at Miramar Systems on an AFP server and actually a full AppleTalk stack that ran on Windows 3.11 (VxDs!) and OS/2. Macs could run full AFP and whatever the printer protocol was called to a network of PCs.

IBM sold a version of our stuff that was called LanServer for Macintosh so back then Macs and AFP were covered!

It was quite a popular product at the time. Although I never enjoyed working on Macs I thought that AFP was pretty cool. We all had "Inside AppleTalk" pretty much memorised - what a great book.

codex 13 years ago |

I would have preferred NFSv4 over SMB2. They are quite similar technically, but the former has less chance of veering off into supporting strange Windowsims which will be hard to translate to a POSIX client. That said, SMB2 is widely deployed and Microsoft is innovating in SMB faster than NFS is improving.

Fortunately OS X does not use Samba as their SMB2 client.

mhurron 13 years ago | |

Most users are going to have a Mac and a Windows machine, SMB makes far more sense. You're going to see NFS in enterprise situations and Apple does not really aim there target there.

mitchty 13 years ago | | |

Not only that but nfsv4 has its own issues in regards to userid/gid mapping.

Setting that up with kerberos is... not fun (speaking from the solaris/linux/aix side of things).

Not that smb would be better for most unixes mind you, just that nfsv4 is its own version of hell in some ways.

velodrome 13 years ago |

This is great. I can finally interoperate with linux and windows.

Every time I connect with AFP, my CPU would spike to 100% under Ubuntu.

lysol 13 years ago | |

This isn't new functionality, it's just that SMB2 is now the default.

vvhn 13 years ago | | |

from a functionality point of view yes - you have always been able to connect to a Linux/BSD box using NFS, SMB(1) ( if samba is installed on them ) or AFP ( if netatalk is installed on them ).

What this thing is saying SMB2 support has been added ( since Mountain Lion does not support SMB2 ) and seems to have been simultaneously made the default for connecting to servers that support it ( hopefully only if you don't explicitly specify what to use and presumably the SMB server on OS X Mavericks does )

velodrome 13 years ago | | |

I heard it was kind of buggy in 10.8.x.

inthewind 13 years ago |

Can someone chime in, with the pros and cons of each network filesystem. And which is a good fit for Linux - or rather for those OSs that don't need to cooperate with Windows? Was NFS ever updated - or replaced? How much of SMB is now open after court rulings? And is their one that is technically better than another?

wazoox 13 years ago | |

My experience with Mac connecting to Linux file servers, is that OS X NFS client performance is fine (110 MB/s over Giga ethernet), and SMB performance sucks badly (70 MB/s on GigE, comparable to the poor old cranky windows XP). AFP performance with netatalk is comparable to NFS, but much more resource intensive on the server. Therefore I always use NFS shares between Linux and Macs.

inthewind 13 years ago | | |

Thanks for the reply. How sucky is something like an SSH mount with fuse, or is that not comparable?

astrodust 13 years ago | |

NFS was never especially good, many a system admin would spend hours upon hours trying to fix it when it malfunctioned for no specific reason, but it was just the only viable option for many years. The alternatives were either research projects, or proprietary protocols like Novell used.

SMB isn't so much better as more widely supported.

lmm 13 years ago | | |

SMB at least avoids the stupid NFS behaviour where any program that tries to read while the server's offline uninterruptibly hangs.

nvr219 13 years ago | |

Use ReiserFS

inthewind 13 years ago | | |

Is ReiserFS a network file system?

Demiurge 13 years ago | | |

I heard it's a killer.

sytelus 13 years ago |

OS X's interchangeability with PCs is actually more badly broken than this. This is mind boggling because if Apple can get this one thing right more people would be willing to buy Mac Mini and put on their home networks. I recently tried to use external device full of NTFS formatted hard drives on Mac Mini. First thing I discovered was that OS X can't natively write to NTFS formatted drives. Even after you discover and purchase 3rd party apps that enables writing to NTFS formatted volumes, OS X can't share them via SMB. This is because Apple's own SMB implementation that they tried to replace is broken. So you have to disable that and install open source SMB anyway. There are quite a bit of hoops to accomplish this.

So there is no built-in way to share your external drives connected to Mac Mini on network if they are NTFS formatted.

shinratdr 13 years ago |

I'm hoping this results in vastly improved SMB support, which I am in full agreement with other commenters, has been infuriating since Apple decided to roll their own. I frequently hop to my Windows machine to manage my Windows Home Server even though I'm just doing simple SMB communication and file cleanups that should work fine in OS X, but don't.

polshaw 13 years ago |

Related: I take it there is no maintained open source SMB server that isn't GPL3 these days? Sucks since apple abandoned samba2. How stupid would it be to use apple's old samba2 for an appliance? (guess: very?)

lmm 13 years ago | |

Can you not just install modern samba yourself?

icebraining 13 years ago | | |

I'd guess polshaw wants to sell/distribute an appliance containing proprietary software (hence the reluctance to use a GPLv3 licensed component), not just install it on his own device.