<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>GeeForce LLC &#187; SAN</title>
	<atom:link href="http://www.geeforce.net/tag/san/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.geeforce.net</link>
	<description>We get technology out of the way of doing business</description>
	<lastBuildDate>Thu, 28 Oct 2010 14:15:17 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.2.1</generator>
	<atom:link rel='hub' href='http://www.geeforce.net/?pushpress=hub'/>
		<item>
		<title>A Scalable E-Mail Architecture</title>
		<link>http://www.geeforce.net/2010/03/e-mail-architecture-2/</link>
		<comments>http://www.geeforce.net/2010/03/e-mail-architecture-2/#comments</comments>
		<pubDate>Wed, 03 Mar 2010 00:31:55 +0000</pubDate>
		<dc:creator>aaron_gee</dc:creator>
				<category><![CDATA[Clients]]></category>
		<category><![CDATA[Hardware]]></category>
		<category><![CDATA[Latest]]></category>
		<category><![CDATA[Networks]]></category>
		<category><![CDATA[Open Source]]></category>
		<category><![CDATA[Software]]></category>
		<category><![CDATA[Document Management]]></category>
		<category><![CDATA[Email]]></category>
		<category><![CDATA[Qmail]]></category>
		<category><![CDATA[SAN]]></category>
		<category><![CDATA[Scalability]]></category>
		<category><![CDATA[single sign in]]></category>
		<category><![CDATA[tiered storage]]></category>

		<guid isPermaLink="false">http://www.geeforce.net/?p=223</guid>
		<description><![CDATA[According to a study done last year by Forrester Research nearly half large enterprise are &#8220;evaluating alternative options for managing and providing email&#8221;.  Why?  It&#8217;s relatively easy to build a highly available, highly redundant email system that can support tens or hundreds of thousands of users easily with free software. The answer to the&#8221;why&#8221; is [...]]]></description>
			<content:encoded><![CDATA[<p>According to <a title="Forrester Study Touted by Google" href="http://www.google.com/a/help/intl/en/admins/pdf/forrester_cloud_email_infrastructure_and_operations_analysis.pdf" target="_blank">a study</a> done last year by Forrester Research nearly half large enterprise are &#8220;evaluating alternative options for managing and providing email&#8221;.  Why?  It&#8217;s relatively easy to build a highly available, highly redundant email system that can support tens or hundreds of thousands of users easily with free software. <a href="http://www.geeforce.net/wp-content/uploads/2010/03/Single_Server_Solution1.png"><img class="alignright size-medium wp-image-250" title="Single_Server_Solution" src="http://www.geeforce.net/wp-content/uploads/2010/03/Single_Server_Solution1-300x219.png" alt="" width="300" height="219" /></a>The answer to the&#8221;why&#8221; is a bit complex and different for every company but the leading cause for email headaches is poor architecture.  Most corporate email systems evolved from a single box.  In a lot of SME&#8217;s there is only &#8220;the mail server&#8221;. That ideal coupled with proprietary software has lead a lot of companies down an unsustainable email path.</p>
<p>A lot of email problems simply go away if the system architecture has been well designed.  The architecture that we lay out here took into consideration ease of email management, high availability, storage growth, data retention, and retrieval.  It is based on open source software, but the ideas and architecture can be applied to proprietary solutions with some modifications.</p>
<p>The analysis of this email problem started by breaking out each action of a typical email transaction (both delivery, management, and retrieval) into very specific tasks and then based on our requirements decide where those tasks belong.  We try to push task intelligence to parts of this clustered design where they make the most sense and provide the most benefit.  The key here was to never create a single point of failure and architect the design so that each task can be scaled seperately from the other tasks.  That way adding another layer of spam protection doesn&#8217;t require a total redesign.</p>
<p>Our solution creates 4 zones;</p>
<ol>
<li>Inbound Zone (SMTP servers facing the Internet)</li>
<li>Storage Zone (Mail delivery and SAN)</li>
<li>Client Zone (Webmail &amp; IMAP servers for client access and outbound SMTP servers)</li>
<li>Business Intelligence Zone (Archival, Tiered Storage Decisions, Company Wide Searches)</li>
</ol>
<h3>Common Data Between Zones</h3>
<p>There are some elements of your email infrastructure that are required to be understood across all zones such as valid usernames, while other information such as password, or mailbox location only needs to be known by some of the zones.  The user information can be stored in a SQL or LDAP server and the information is replicated to each zone.  The data stored in SQL or LDAP can be used for other applications not related to mail such as user authentication, instant messaging, and billing.  In some Enterprises this requires the user SQL/LDAP layer to be pulled out into it&#8217;s own environment in others it requires a hybrid LDAP/SQL solution.  In our sample architecture the system in question relied on <a title="MySQL the worlds most popular open source SQL database" href="http://mysql.com/" target="_blank">MySQL </a>and replication was used on each machine to provide a local SQL store.</p>
<h3><strong>Zone 1</strong> : Inbound</h3>
<p>Inbound mail servers are defined in a domain&#8217;s DNS and it&#8217;s simple to delegate multiple inbound servers.  In the classic single box solution, there is only one inbound server.  The single server has to handle all inbound connections, all filtering, the mail store, and client connections. When the single server is flooded with lots of traffic, that traffic eats up resources and  ruins the end users email experience.  In the properly architected solution the load of incoming traffic is spread out among multiple servers that can be geographically diverse.</p>
<p>The inbound servers are also the first line of defense against unwanted mail.  The ideal is to prevent all suspect mail from ever making it into the mail infrastructure.  Why waste the end user CPU cycles, or mail storage on spam or virus emails?  In this configuration the inbound servers protect the mail store from unnecessary email traffic. After processing the accepted mail the inbound servers hand the email off to the mail store over a private network and deliver messages via QMQP or SMTP, adding another layer of protection as those connections can be throttled by the mail delivery servers to protect the mail store allowing the zone1 servers to act as a buffer during extreme traffic conditions.</p>
<h4><span style="text-decoration: underline;">Zone 1 features:</span></h4>
<ul>
<li>Inbound servers have their own mail queue so that they can store mail if Zone 2 goes offline for any reason</li>
<li>Inbound servers make decisions on accepting connectivity via real time black lists (RBL)</li>
<li>Inbound servers make decisions on accepting mail for users during the SMTP transaction (don&#8217;t accept mail that has to be bounced later)</li>
<li>Inbound servers handle SPAM and Virus tagging before handing messages to Zone 2</li>
<li>Virus &amp; spam analysis can be offloaded to other servers if the load is too high on the inbound servers providing an easy solution for additional capacity by simply adding more machines (virtual or otherwise) to the zone.</li>
</ul>
<h3>Zone 2 Storage</h3>
<p>The mail store consists of 2 parts, the delivery machines and the storage area network (SAN).  The delivery machines receive email from Zone 1 and store in on the SAN, following any user specific delivery rules.  Unlike other systems the mail sorting is done during delivery.  This reduces the number of times a message &#8220;moves&#8221; around on the file system, and requires less handling. Both front ends mounted the same SAN share using a distributed file system (<a title="Wiki entry for GFS (Global File Syestem)" href="http://en.wikipedia.org/wiki/Global_File_System" target="_blank">gfs2</a>).</p>
<p>In our system the delivery machines were also the master SQL servers in master/master replication and master/slave replication to the other zones.  All user updates, adds and deletes are managed via a web interface attached to the SQL servers in zone2.  All of the zone 1 machines were pointed to a single IP, and the two delivery machines run in high availability mode with load balancing.</p>
<h4><span style="text-decoration: underline;">Zone 2 features:</span></h4>
<ul>
<li>Storage growth is handled by the SAN &amp; choice of File system.  Simply add more storage and then <a title="Redhat manual for managing GFS file system" href="http://www.redhat.com/docs/manuals/csgfs/admin-guide/s1-manage-growfs.html" target="_blank">grow the file system</a>.</li>
<li>Tiered Storage can be provided by multiple SANs.  A high performance SAN for recent email and a slower but larger SAN for archival purposes.</li>
<li>Delivery rules are stored and executed during the first delivery.</li>
<li>Delivery can be scaled by adding front ends to either a common distributed backend storage or multiple common backends.</li>
<li>The SAN is fully mirrored.  Should the primary SAN fail the backup SAN comes online automatically.  File system mirroring is handled at the SAN level.</li>
<li>Since each clients mail store location is kept in a SQL server the ability to migrate from one SAN to another can be done &#8220;online&#8221; with no downtime.</li>
</ul>
<h3>
<div id="attachment_258" class="wp-caption alignleft" style="width: 227px"><a href="http://www.geeforce.net/wp-content/uploads/2010/03/Distributed_Architecture1.png"><img class="size-medium wp-image-258" title="Distributed_Architecture" src="http://www.geeforce.net/wp-content/uploads/2010/03/Distributed_Architecture1-217x300.png" alt="Distributed_Architecture" width="217" height="300" /></a><p class="wp-caption-text">Distributed Architecture</p></div>
<p>Zone 3: Clients</h3>
<p>Zone 3 is the end user zone.  This zone takes care of webmail, smtp relaying (outbound), and imap clients (outlook &amp; smart phones).  In our configuration there are two machines that mount the same SAN and run 3 services IMAP, HTTPS, &amp; SMTP.  The 2 servers run in loadbalancing/high availability mode.  In this case the traffic combined with webmail load was light enough to combine all of the client services onto single machine.  Each client service can be easily moved to their own server providing scalability.  This zone deals entirely with internal client requests.  If a client receives, checks, or sends an email, regardless of device (laptop, phone, etc) it goes through this zone.</p>
<h3>Zone 4: Business Intelligence</h3>
<p>This zone mounts the same SAN and handles things like auto archiving, indexing of emails for better IMAP performance and other functions the touch your email but whose primary function ISN&#8217;T email.  Email management tools live in this zone (Web based in this case). The advantage of having a dedicated business intelligence zone is that this provides for application specific functionality and connectivity without adding to the performance requirements of any one specific area of typical email transactions.</p>
<p>Examples of good use zone 4 include document management software that indexes company wide emails.  This types of indexing becomes invaluable when discovery orders are issued or an executive leaves under dubious circumstances.  Custom reporting on email usage and quotas organized across corporate divisions provide reporting that enables IT to make rational choices on where resources will be best spent.  This zone is also where programs designed to automate tired storage and auto archiving decisions need to go.</p>
<p>Having one place to go to write/execute that intelligence provides an enterprise the flexibility that they need when addressing email specific issues AND it does it in a way that minimally impacts email.  A perfect example of what happens when you build that intelligence into the wrong place would be an auto archive program that a certain hypothetical email admin might install for their enterprise.  The auto archiving is too aggressive in it&#8217;s endeavor to archive everything older than (x) days (the default setting), leading to a huge slow down in the enterprise&#8217;s email delivery. The helpdesk phones won&#8217;t stop ringing and one can expect the fainter of heart support staff to be reduced to quivering piles of jello in a cubicle.  In the enterprise clients get cranky when the email doesn&#8217;t work.  When things finally get caught up the legal staff shows up on the admin&#8217;s doorsteps with pitchforks and torches.  Not Good.</p>
<p>Some system architects or vendors want tiered storage or auto archiving to live on the primary mail store, or in storage.  The issue is that neither of those areas has the native intelligence to understand how users use, or are required to access to email better than the user.  It gets hard to tell your SAN which users email folders needs to be faster; For example the CEO that refuses to archive and calls when searches take more than 5 seconds or try to have your mail server define which email documents are connected to a legal case. Business intelligence isn&#8217;t an oxymoron until your SAN decides which email is archived for you.</p>
<p>Design your business intelligence where it belongs, and where you can react quickly without impacting the primary function of your email system, which is to deliver mail.  When you tie it all together you have a low maintenance highly scalable email solution that a Fortune 100 company would be proud of.  All it took was a little bit of up front thought to design the proper architecture.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.geeforce.net/2010/03/e-mail-architecture-2/feed/</wfw:commentRss>
		<slash:comments>7</slash:comments>
		</item>
		<item>
		<title>CFITS &#8211; Good stuff from Xiotech &amp; Cisco</title>
		<link>http://www.geeforce.net/2010/02/cfits-good-stuff-from-xiotech-cisco/</link>
		<comments>http://www.geeforce.net/2010/02/cfits-good-stuff-from-xiotech-cisco/#comments</comments>
		<pubDate>Fri, 26 Feb 2010 20:24:17 +0000</pubDate>
		<dc:creator>aaron_gee</dc:creator>
				<category><![CDATA[Hardware]]></category>
		<category><![CDATA[Latest]]></category>
		<category><![CDATA[Networks]]></category>
		<category><![CDATA[Software]]></category>
		<category><![CDATA[CFITS]]></category>
		<category><![CDATA[Cisco]]></category>
		<category><![CDATA[NAS]]></category>
		<category><![CDATA[Network]]></category>
		<category><![CDATA[SAN]]></category>
		<category><![CDATA[UCS]]></category>
		<category><![CDATA[Virtualization]]></category>
		<category><![CDATA[Xiotech]]></category>

		<guid isPermaLink="false">http://www.geeforce.net/?p=165</guid>
		<description><![CDATA[Yesterday I had the pleasure of being one of four speakers at the network storage event sponsored by the CFITS (Central Florida Information Technology Society).  All four presentations have been put together in a single power point that is posted on the CFITS website.  I&#8217;ve included a flash version of just the GeeForce slides here.  [...]]]></description>
			<content:encoded><![CDATA[<p>Yesterday I had the pleasure of being one of four speakers at the network storage event sponsored by the CFITS <a title=" Central Florida Information Technology Society" href="http://cfits.org/" target="_blank">(Central Florida Information Technology Society</a>).  All four presentations have been put together in <a title="CFITS Combined Network Storage Presentation" href="http://cfits.org/images/presentations/2010.storage/data.storage.02.25.10.pptx" target="_blank">a single power point</a> that is posted on the CFITS website.  I&#8217;ve included a flash version of just the GeeForce slides <a title="CFITS GeeForce Network Storage Presentation" href="http://www.geeforce.net/wp-content/uploads/2010/02/Network_Storage.swf" target="_blank">here</a>.  The great thing about events put on by CFITS is that they attract some really bright people and top tier vendors.  While all of the presentations were good two stuck out in my mind.</p>
<h4><strong>Xiotech gets storage</strong></h4>
<p>I enjoyed the <a title="Xiotech Company Website" href="http://www.xiotech.com/" target="_blank">Xiotech</a> presentation by <a title="Peter Selin's Blog" href="http://peterselin.sys-con.com/">Peter Selin</a> whose presentation followed mine.  His emphasis on true Total Cost of Ownership (TCO) calculations and understanding how the applications use storage dovetailed very nicely with points I had made earlier.  Xiotech&#8217;s presentation went a step further and went into application tuning and how that affects storage performance.  A good part of the presentation was &#8220;SSD  facts or fiction&#8221;.  There was an enlightening graph on  SSD (Solid State Drives) sustained IOPS  vs Time.  This was nothing new for those of us with SSD server experience, but an eye opener for a lot of people in the room.</p>
<p>If you&#8217;re unfamiliar with Xiotech&#8217;s concept, then now might be the time to explain what they do.  Xiotech looks at storage like a &#8220;black box&#8221;.  It doesn&#8217;t matter what&#8217;s in the box &#8211; what matters is the capacity, throughput and reliability of the data storage.  Their solutions utilize Fiber Channel and provide the foundation for a high performance SAN (Storage Area Network).   One of the most unique aspects is that the end user no longer worries about individual drives and data redundancy. Data redundancy  is taken care of by &#8220;the box&#8221;.  To add more capacity, add another box.  This moves intelligence out of controllers and applications into storage where it belongs.  Just like intelligent networks, having the right intelligence in the right place makes a lot of sense.</p>
<p>I haven&#8217;t had a chance to test or use the products but their architecture deserves a very close look when high performance storage is called for.  The company is talking about Fiber Channel over Ethernet in the future and I hope that they also look at Ata Over Ethernet (<a title="Wiki artilce on ATA over Ethernet" href="http://en.wikipedia.org/wiki/ATA_over_Ethernet" target="_blank">AoE</a>) as well.</p>
<h4><strong>Cisco goes after the datacenter</strong></h4>
<p>Cisco&#8217;s presentations always pique my interest.  This is a company that spends a lot of time figuring out how to produce a better mouse trap (or buying the company that has) and it shows.</p>
<p><span style="text-decoration: underline;">Network and Storage</span> Cisco&#8217;s approach is a continuation of the approach that it helped pioneer, network convergence.  Yesterday&#8217;s converging voice, video, and data via IP is passé; Cisco is now converging the SAN/LAN (Local Area Network) networks into a <a title="Cisco Nexus 5000 &amp; Unified Fabric" href="http://www.cisco.com/en/US/products/ps9670/index.html">unified fabric</a>.  With Fiber Channel over Ethernet the same network is used for SAN and LAN connectivity, simplifying cabling and switches.  With Cisco&#8217;s Fiber Channel/Ethernet modules for their Nexus class switches, Cisco is providing a bridge between the current SAN and LAN.  With 10GE (Gigabit Ethernet) networks already here and 40GE just around the corner, the writing is on the wall.  Eventually all LAN &amp; SAN traffic will be carried on the same network.  <a title="Robert Metcalfe the inventor of Ethernet" href="http://en.wikipedia.org/wiki/Robert_Metcalfe">Robert Metcalfe&#8217;s</a> invention lives eternal.</p>
<p><span style="text-decoration: underline;">Servers and Virtualization</span> This part of Cisco&#8217;s offering is where we see radical innovation. Cisco doesn&#8217;t have a history of building servers so their approach is clean sheet  and unique from what I&#8217;ve seen from other vendors.  What Cisco did was look at large virtualized environments holistically not just focusing on server, storage, or network individually.  Cisco has tried to converge and unify many components of a large virtulized environment and build management into the entire environment from the get go.  They call their approach the Unified Computing System or <a title="Cisco's Unified Computing System" href="http://www.cisco.com/en/US/netsol/ns944/index.html">UCS</a>.</p>
<p>The UCS structure combines a unified (or should we say converged?) 10GE network fabric with unique super high memory blade servers<strong> </strong> that can support up to 384 GB DDR3.  The management of the entire structure is built in.  Cisco provides for a virtualized switch within each blade, each virtualized server can be centrally managed in it&#8217;s entirety.  Moving a virtual instance from one blade to another becomes simpler because the network moves with the instance and doesn&#8217;t require reprogramming the switch.  Cisco&#8217;s approach will change the entire management experience for large virtualized environments.</p>
<p>Both presentations have given me a great excuse to deep dive into the vendor&#8217;s technology and applications thereof.  Cisco is showing off their C-Series servers in Orlando on March 9th (<a title="Register for March 9th 2010 Cisco Presentation" href="http://www.cisco.com/go/semreg/csfm_ciscoserver_09mar10/html" target="_blank">register for that even t here</a>) and there is some <a title="Xiotec Bedtime Reading Material" href="http://xiotech.com/resource-center.php">good reading material </a>over a Xiotech.  Keeping up with new technology and it&#8217;s application is one of the things I enjoy most about my job.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.geeforce.net/2010/02/cfits-good-stuff-from-xiotech-cisco/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
	</channel>
</rss>

