
Ìáµ½´óÊý¾Ý·ÖÎöƽ̨£¬²»µÃ²»ËµHadoopϵͳ£¬Hadoopµ½ÏÖÔÚÒ²³¬¹ý10ÄêµÄÀúÊ·ÁË£¬ºÜ¶à¶«Î÷·¢ÉúÁ˱仯£¬°æ±¾Ò²´Ó0.x½ø»¯µ½Ä¿Ç°µÄ2.6°æ±¾¡£ÎÒ°Ñ2012Äêºó¶¨Òå³ÉºóHadoopƽ̨ʱ´ú£¬Õâ²»ÊÇ˵²»ÓÃHadoop£¬¶øÊÇÏñNoSQL
£¨Not Only SQL£©ÄÇÑù£¬ÓÐÆäËûµÄÑ¡ÐͲ¹³ä¡£ÎÒÔÚÖªºõÉÏҲд¹ýHadoopµÄһЩÈëÃÅÎÄÕ ÈçºÎѧϰHadoop
- ¶·ÉµÄ»Ø´ð£¬ÎªÁ˸ø´ó¼ÒÓиöÆÌµæ£¬¼òµ¥½²Ò»Ð©Ïà¹Ø¿ªÔ´×é¼þ¡£
±³¾°Æª
- Hadoop: ¿ªÔ´µÄÊý¾Ý·ÖÎöƽ̨£¬½â¾öÁË´óÊý¾Ý£¨´óµ½Ò»Ì¨¼ÆËã»úÎÞ·¨½øÐд洢£¬Ò»Ì¨¼ÆËã»úÎÞ·¨ÔÚÒªÇóµÄʱ¼äÄÚ½øÐд¦Àí£©µÄ¿É¿¿´æ´¢ºÍ´¦Àí¡£Êʺϴ¦Àí·Ç½á¹¹»¯Êý¾Ý£¬°üÀ¨HDFS£¬MapReduce»ù±¾×é¼þ¡£
- HDFS£ºÌṩÁËÒ»ÖÖ¿ç·þÎñÆ÷µÄµ¯ÐÔÊý¾Ý´æ´¢ÏµÍ³¡£
- MapReduce£º¼¼ÊõÌṩÁ˸ÐÖªÊý¾ÝλÖõıê×¼»¯´¦ÀíÁ÷³Ì£º¶ÁÈ¡Êý¾Ý£¬¶ÔÊý¾Ý½øÐÐÓ³É䣨Map£©£¬Ê¹ÓÃij¸ö¼üÖµ¶ÔÊý¾Ý½øÐÐÖØÅÅ£¬È»ºó¶ÔÊý¾Ý½øÐл¯¼ò£¨Reduce£©µÃµ½×îÖÕµÄÊä³ö¡£
- Amazon Elastic Map Reduce(EMR)£ºÍйܵĽâ¾ö·½°¸£¬ÔËÐÐÔÚÓÉAmazon
Elastic Compute Cloud£¨EC2£©ºÍSimple Strorage Service£¨S3£©×é³ÉµÄÍøÂç¹æÄ£µÄ»ù´¡Éèʩ֮ÉÏ¡£Èç¹ûÄãÐèÒªÒ»´ÎÐԵĻò²»³£¼ûµÄ´óÊý¾Ý´¦Àí£¬EMR¿ÉÄÜ»áΪÄã½ÚÊ¡¿ªÖ§¡£µ«EMRÊǸ߶ÈÓÅ»¯³ÉÓëS3ÖеÄÊý¾ÝÒ»Æð¹¤×÷£¬»áÓнϸߵÄÑÓʱ¡£
- Hadoop »¹°üº¬ÁËһϵÁм¼ÊõµÄÀ©Õ¹ÏµÍ³£¬ÕâЩ¼¼ÊõÖ÷Òª°üÀ¨ÁËSqoop¡¢Flume¡¢Hive¡¢Pig¡¢Mahout¡¢DatafuºÍHUEµÈ¡£
- Pig£º·ÖÎö´óÊý¾Ý¼¯µÄÒ»¸öƽ̨£¬¸Ãƽ̨ÓÉÒ»ÖÖ±í´ïÊý¾Ý·ÖÎö³ÌÐòµÄ¸ß¼¶ÓïÑԺͶÔÕâЩ³ÌÐò½øÐÐÆÀ¹ÀµÄ»ù´¡ÉèʩһÆð×é³É¡£
- Hive£ºÓÃÓÚHadoopµÄÒ»¸öÊý¾Ý²Ö¿âϵͳ£¬ËüÌṩÁËÀàËÆÓÚSQLµÄ²éѯÓïÑÔ£¬Í¨¹ýʹÓøÃÓïÑÔ£¬¿ÉÒÔ·½±ãµØ½øÐÐÊý¾Ý»ã×Ü£¬Ìض¨²éѯÒÔ¼°·ÖÎö¡£
- Hbase£ºÒ»ÖÖ·Ö²¼µÄ¡¢¿ÉÉìËõµÄ¡¢´óÊý¾Ý´¢´æ¿â£¬Ö§³ÖËæ»ú¡¢ÊµÊ±¶Á/д·ÃÎÊ¡£
- Sqoop£ºÎª¸ßЧ´«ÊäÅúÁ¿Êý¾Ý¶øÉè¼ÆµÄÒ»ÖÖ¹¤¾ß£¬ÆäÓÃÓÚApache HadoopºÍ½á¹¹»¯Êý¾Ý´¢´æ¿âÈç¹ØÏµÊý¾Ý¿âÖ®¼äµÄÊý¾Ý´«Êä¡£
- Flume£ºÒ»ÖÖ·Ö²¼Ê½µÄ¡¢¿É¿¿µÄ¡¢¿ÉÓõķþÎñ£¬ÆäÓÃÓÚ¸ßЧµØËѼ¯¡¢»ã×Ü¡¢Òƶ¯´óÁ¿ÈÕÖ¾Êý¾Ý¡£
- ZooKeeper£ºÒ»ÖÖ¼¯ÖзþÎñ£¬ÆäÓÃÓÚά»¤ÅäÖÃÐÅÏ¢£¬ÃüÃû£¬Ìṩ·Ö²¼Ê½Í¬²½£¬ÒÔ¼°Ìṩ·Ö×é·þÎñ¡£
- Cloudera£º×î³ÉÐ͵ÄHadoop·¢Ðа汾£¬ÓµÓÐ×î¶àµÄ²¿Êð°¸Àý¡£Ìṩǿ´óµÄ²¿Êð¡¢¹ÜÀíºÍ¼à¿Ø¹¤¾ß¡£¿ª·¢²¢¹±Ï×ÁË¿Éʵʱ´¦Àí´óÊý¾ÝµÄImpalaÏîÄ¿¡£
- Hortonworks£ºÊ¹ÓÃÁË100%¿ªÔ´Apache HadoopÌṩÉÌ¡£¿ª·¢Á˺ܶàÔöÇ¿ÌØÐÔ²¢Ìá½»ÖÁºËÐÄÖ÷¸É£¬ÕâʹµÃHadoopÄܹ»ÔÚ°üÀ¨Windows
ServerºÍAzureÔÚÄÚÆ½Ì¨Éϱ¾µØÔËÐС£
- MapR£º»ñÈ¡¸üºÃµÄÐÔÄܺÍÒ×ÓÃÐÔ¶øÖ§³Ö±¾µØUnixÎļþϵͳ¶ø²»ÊÇHDFS¡£ÌṩÖîÈç¿ìÕÕ¡¢¾µÏñ»òÓÐ״̬µÄ¹ÊÕϻָ´µÈ¸ß¿ÉÓÃÐÔÌØÐÔ¡£Áìµ¼×ÅApache
DrillÏîÄ¿£¬ÊÇGoogleµÄDremelµÄ¿ªÔ´ÊµÏÖ£¬Ä¿µÄÊÇÖ´ÐÐÀàËÆSQLµÄ²éѯÒÔÌṩʵʱ´¦Àí¡£
ÔÀíÆª
Êý¾Ý´æ´¢
ÎÒÃǵÄÄ¿±êÊÇ×öÒ»¸ö¿É¿¿µÄ£¬Ö§³Ö´ó¹æÄ£À©Õ¹ºÍÈÝÒ×ά»¤µÄϵͳ¡£¼ÆËã»úÀïÃæÓиölocality£¨¾Ö²¿ÐÔ¶¨ÂÉ£©£¬ÈçͼËùʾ¡£´Óϵ½ÉÏ·ÃÎÊËÙ¶ÈÔ½À´Ô½¿ì£¬µ«´æ´¢´ú¼Û¸ü´ó¡£

Ïà¶ÔÄڴ棬´ÅÅ̺ÍSSD¾ÍÐèÒª¿¼ÂÇÊý¾ÝµÄ°Ú·Å£¬ ÒòΪÐÔÄÜ»á²îÒìºÜ´ó¡£´ÅÅ̺ô¦Êdz־û¯£¬µ¥Î»³É±¾±ãÒË£¬ÈÝÒ×±¸·Ý¡£µ«Ëæ×ÅÄÚ´æ±ãÒË£¬ºÜ¶àÊý¾Ý¼¯ºÏ¿ÉÒÔ¿¼ÂÇÖ±½Ó·ÅÈëÄÚ´æ²¢·Ö²¼µ½¸÷»úÆ÷ÉÏ£¬ÓÐЩ»ùÓÚ
key-value, MemcachedÓÃÔÚ»º´æÉÏ¡£ÄÚ´æµÄ³Ö¾Ã»¯¿ÉÒÔͨ¹ý (´øµç³ØµÄRAM)£¬ÌáǰдÈëÈÕÖ¾ÔÙ¶¨ÆÚ×öSnapshot»òÕßÔÚÆäËû»úÆ÷ÄÚ´æÖи´ÖÆ¡£µ±ÖØÆôʱÐèÒª´Ó´ÅÅÌ»òÍøÂçÔØÈë֮ǰ״̬¡£ÆäʵдÈë´ÅÅ̾ÍÓÃÔÚ×·¼ÓÈÕÖ¾ÉÏÃæ
£¬¶ÁµÄ»°¾ÍÖ±½Ó´ÓÄÚ´æ¡£ÏñVoltDB, MemSQL£¬RAMCloud ¹ØÏµÐÍÓÖ»ùÓÚÄÚ´æÊý¾Ý¿â£¬¿ÉÒÔÌṩ¸ßÐÔÄÜ£¬½â¾ö֮ǰ´ÅÅ̹ÜÀíµÄÂé·³¡£

HyperLogLog & Bloom Filter &
CountMin Sketch
¶¼ÊÇÊÇÓ¦ÓÃÓÚ´óÊý¾ÝµÄËã·¨£¬´óÖÂ˼·ÊÇÓÃÒ»×éÏ໥¶ÀÁ¢µÄ¹þÏ£º¯ÊýÒÀ´Î´¦ÀíÊäÈë¡£HyperLogLog
ÓÃÀ´¼ÆËãÒ»¸öºÜ´ó¼¯ºÏµÄ»ùÊý£¨¼´ºÏÀí×ܹ²ÓжàÉÙ²»ÏàͬµÄÔªËØ£©£¬¶Ô¹þÏ£Öµ·Ö¿é¼ÆÊý£º¶Ô¸ßλͳ¼ÆÓжàÉÙÁ¬ÐøµÄ0£»ÓõÍλµÄÖµµ±×öÊý¾Ý¿é¡£BloomFilter,ÔÚÔ¤´¦Àí½×¶Î¶ÔÊäÈëËã³öËùÓйþÏ£º¯ÊýµÄÖµ²¢×ö³ö±ê¼Ç¡£µ±²éÕÒÒ»¸öÌØ¶¨µÄÊäÈëÊÇ·ñ³öÏÖ¹ý£¬Ö»Ðè²éÕÒÕâһϵÁеĹþÏ£º¯Êý¶ÔÓ¦ÖµÉÏÓÐûÓбê¼Ç¡£¶ÔÓÚBloomFilter£¬¿ÉÄÜÓÐFalse
Positive£¬µ«²»¿ÉÄÜÓÐFalse Negative¡£BloomFilter¿É¿´×ö²éÕÒÒ»¸öÊý¾ÝÓлòÕßûÓеÄÊý¾Ý½á¹¹£¨Êý¾ÝµÄƵÂÊÊÇ·ñ´óÓÚ1£©¡£CountMin
SketchÔÚBloomFilterµÄ»ù´¡Éϸü½øÒ»²½£¬Ëü¿ÉÓÃÀ´¹ÀËãijһ¸öÊäÈëµÄƵÂÊ£¨²»¾ÖÏÞÓÚ´óÓÚ1£©¡£
CAP Theorem

¼òµ¥ËµÊÇÈý¸öÌØÐÔ£ºÒ»ÖÂÐÔ£¬¿ÉÓÃÐÔºÍÍøÂç·ÖÇø£¬×î¶àÖ»ÄÜÈ¡Æä¶þ¡£Éè¼Æ²»Í¬ÀàÐÍϵͳҪ¶àȥȨºâ¡£·Ö²¼Ê½ÏµÍ³»¹ÓкܶàËã·¨ºÍ¸ßÉîÀíÂÛ£¬±ÈÈ磺PaxosËã·¨£¨paxos·Ö²¼Ê½Ò»ÖÂÐÔËã·¨--½²ÊöÖî¸ðÁÁµÄ·´´©Ô½£©£¬GossipÐÒ飨Cassandraѧϰ±Ê¼ÇÖ®GossipÐÒ飩£¬Quorum
(·Ö²¼Ê½ÏµÍ³)£¬Ê±¼äÂß¼£¬ÏòÁ¿Ê±ÖÓ£¨Ò»ÖÂÐÔËã·¨Ö®ËÄ: ʱ¼ä´ÁºÍÏòÁ¿Í¼£©£¬°Ýռͥ½«¾üÎÊÌ⣬¶þ½×¶ÎÌá½»µÈ£¬ÐèÒªÄÍÐÄÑо¿¡£
¼¼Êõƪ

¸ù¾Ý²»Í¬µÄÑÓ³ÙÒªÇó£¨SLA£©£¬Êý¾ÝÁ¿´æ´¢´óС£¬ ¸üÐÂÁ¿¶àÉÙ£¬·ÖÎöÐèÇ󣬴óÊý¾Ý´¦ÀíµÄ¼Ü¹¹Ò²ÐèÒª×öÁé»îµÄÉè¼Æ¡£ÉÏͼ¾ÍÃèÊöÁËÔÚ²»Í¬ÁìÓòÖдóÊý¾Ý×é¼þ¡£
˵´óÊý¾ÝµÄ¼¼Êõ»¹ÊÇÒªÏÈÌáGoogle£¬Google ÐÂÈýÁ¾Âí³µ£¬Spanner,
F1, Dremel
Spanner£º¸ß¿ÉÀ©Õ¹¡¢¶à°æ±¾¡¢È«Çò·Ö²¼Ê½Íâ¼Óͬ²½¸´ÖÆÌØÐԵĹȸèÄÚ²¿Êý¾Ý¿â£¬Ö§³ÖÍⲿһÖÂÐԵķֲ¼Ê½ÊÂÎñ;Éè¼ÆÄ¿±êÊǺá¿çÈ«ÇòÉϰٸöÊý¾ÝÖÐÐÄ,¸²¸Ç°ÙÍǫ̀·þÎñÆ÷,°üº¬ÍòÒÚÌõÐмǼ£¡(Google¾ÍÊÇÕâô°ÔÆø^-^)
F1: ¹¹½¨ÓÚSpannerÖ®ÉÏ,ÔÚÀûÓÃSpannerµÄ·á¸»ÌØÐÔ»ù´¡Ö®ÉÏ,»¹Ìṩ·Ö²¼Ê½SQL¡¢ÊÂÎñÒ»ÖÂÐԵĶþ¼¶Ë÷ÒýµÈ¹¦ÄÜ,ÔÚAdWords¹ã¸æÒµÎñÉϳɹ¦´úÌæÁË֮ǰÀϾɵÄÊÖ¹¤MySQL
Shard·½°¸¡£
Dremel: Ò»ÖÖÓÃÀ´·ÖÎöÐÅÏ¢µÄ·½·¨£¬Ëü¿ÉÒÔÔÚÊýÒÔǧ¼ÆµÄ·þÎñÆ÷ÉÏÔËÐУ¬ÀàËÆÊ¹ÓÃSQLÓïÑÔ£¬ÄÜÒÔ¼«¿ìµÄËÙ¶È´¦ÀíÍøÂç¹æÄ£µÄº£Á¿Êý¾Ý(PBÊýÁ¿¼¶)£¬Ö»Ð輸ÃëÖÓʱ¼ä¾ÍÄÜÍê³É¡£
Spark

2014Äê×î»ðµÄ´óÊý¾Ý¼¼ÊõSpark£¬ÓÐʲô¹ØÓÚ Spark µÄÊéÍÆ¼ö£¿
- ¶·ÉµÄ»Ø´ð ×öÁ˽éÉÜ¡£Ö÷ÒªÒâͼÊÇ»ùÓÚÄÚ´æ¼ÆËã×ö¸ü¿ìµÄÊý¾Ý·ÖÎö¡£Í¬Ê±Ö§³Öͼ¼ÆË㣬Á÷ʽ¼ÆËãºÍÅú´¦Àí¡£Berkeley
AMP LabµÄºËÐijÉÔ±³öÀ´³ÉÁ¢¹«Ë¾Databricks¿ª·¢Cloud²úÆ·¡£
Flink

ʹÓÃÁËÒ»ÖÖÀàËÆÓÚSQLÊý¾Ý¿â²éѯÓÅ»¯µÄ·½·¨£¬ÕâÒ²ÊÇËüÓ뵱ǰ°æ±¾µÄApache
SparkµÄÖ÷񻂿±ð¡£Ëü¿ÉÒÔ½«È«¾ÖÓÅ»¯·½°¸Ó¦ÓÃÓÚij¸ö²éѯ֮ÉÏÒÔ»ñµÃ¸ü¼ÑµÄÐÔÄÜ¡£
Kafka

Announcing the Confluent Platform
1.0 Kafka ÃèÊöΪ LinkedIn µÄ¡°ÖÐÊàÉñ¾ÏµÍ³¡±£¬¹ÜÀí´Ó¸÷¸öÓ¦ÓóÌÐò»ã¾Ûµ½´ËµÄÐÅÏ¢Á÷£¬ÕâЩÊý¾Ý¾¹ý´¦ÀíºóÔÙ±»·Ö·¢µ½¸÷´¦¡£²»Í¬ÓÚ´«Í³µÄÆóÒµÐÅÏ¢ÁжÓϵͳ£¬Kafka
ÊÇÒÔ½üºõʵʱµÄ·½Ê½´¦ÀíÁ÷¾Ò»¸ö¹«Ë¾µÄËùÓÐÊý¾Ý£¬Ä¿Ç°ÒѾΪ LinkedIn, Netflix, Uber
ºÍ Verizon ½¨Á¢ÁËʵʱÐÅÏ¢´¦ÀíÆ½Ì¨¡£Kafka µÄÓÅÊÆ¾ÍÔÚÓÚ½üºõʵʱÐÔ¡£
Storm

Handle Five Billion Sessions a Day
in Real Time£¬TwitterµÄʵʱ¼ÆËã¿ò¼Ü¡£ËùνÁ÷´¦Àí¿ò¼Ü£¬¾ÍÊÇÒ»ÖÖ·Ö²¼Ê½¡¢¸ßÈÝ´íµÄʵʱ¼ÆËãϵͳ¡£StormÁî³ÖÐø²»¶ÏµÄÁ÷¼ÆËã±äµÃÈÝÒס£¾³£ÓÃÓÚÔÚʵʱ·ÖÎö¡¢ÔÚÏß»úÆ÷ѧϰ¡¢³ÖÐø¼ÆËã¡¢·Ö²¼Ê½Ô¶³Ìµ÷ÓúÍETLµÈÁìÓò¡£
Samza

LinkedInÖ÷ÍÆµÄÁ÷ʽ¼ÆËã¿ò¼Ü¡£ÓëÆäËûÀàËÆµÄSpark£¬Storm×öÁ˼¸¸ö±È½Ï¡£¸úKafka¼¯³ÉÁ¼ºÃ£¬×÷ΪÖ÷ÒªµÄ´æ´¢½ÚµãºÍÖн顣
Lambda architecture
NathanдÁËÎÄÕ¡¶ÈçºÎÈ¥´ò°ÜCAPÀíÂÛ¡·How to beat
the CAP theorem£¬Ìá³öLambda Architecture£¬Ö÷Ҫ˼ÏëÊǶÔһЩÑӳٸߵ«Êý¾ÝÁ¿´óµÄ»¹ÊDzÉÓÃÅú´¦Àí¼Ü¹¹£¬µ«¶ÔÓÚ¼´Ê±ÐÔʵʱÊý¾ÝʹÓÃÁ÷ʽ´¦Àí¿ò¼Ü£¬È»ºóÔÚÖ®Éϴһ¸ö·þÎñ²ãÈ¥ºÏ²¢Á½±ßµÄÊý¾ÝÁ÷£¬ÕâÖÖϵͳÄܹ»Æ½ºâʵʱµÄ¸ßЧºÍÅú´¦ÀíµÄScale£¬¿´Á˾õµÃÄÔ¶´´ó¿ª£¬È·ÊµºÜÓÐЧ£¬±»ºÜ¶à¹«Ë¾²ÉÓÃÔÚÉú²úϵͳÖС£

Summingbird
Lambda¼Ü¹¹µÄÎÊÌâҪά»¤Á½Ì×ϵͳ£¬Twitter¿ª·¢ÁËSummingbirdÀ´×öµ½Ò»´Î±à³Ì£¬¶à´¦ÔËÐС£½«Åú´¦ÀíºÍÁ÷´¦ÀíÎÞ·ìÁ¬½Ó£¬Í¨¹ýÕûºÏÅú´¦ÀíÓëÁ÷´¦ÀíÀ´¼õÉÙËüÃÇÖ®¼äµÄת»»¿ªÏú¡£ÏÂͼ¾Í½âÊÍÁËϵͳÔËÐÐʱ¡£

NoSQL
Êý¾Ý´«Í³ÉÏÊÇÓÃÊ÷Ðνṹ´æ´¢£¨²ã´Î½á¹¹£©£¬µ«ºÜÄѱíʾ¶à¶Ô¶àµÄ¹ØÏµ£¬¹ØÏµÐÍÊý¾Ý¿â¾ÍÊǽâ¾öÕâ¸öÄÑÌ⣬×î½ü¼¸Äê·¢ÏÖ¹ØÏµÐÍÊý¾Ý¿âÒ²²»ÁéÁË£¬ÐÂÐÍNoSQL³öÏÖÈçCassandra£¬MongoDB£¬Couchbase¡£NoSQL
ÀïÃæÒ²·Ö³ÉÕ⼸À࣬ÎĵµÐÍ£¬Í¼ÔËËãÐÍ£¬Áд洢£¬key-valueÐÍ£¬²»Í¬ÏµÍ³½â¾ö²»Í¬ÎÊÌ⡣ûһ¸öone-size-fits-all
µÄ·½°¸¡£

Cassandra
´óÊý¾Ý¼Ü¹¹ÖУ¬CassandraµÄÖ÷Òª×÷ÓþÍÊÇ´æ´¢½á¹¹»¯Êý¾Ý¡£DataStaxµÄCassandraÊÇÒ»ÖÖÃæÏòÁеÄÊý¾Ý¿â£¬Ëüͨ¹ý·Ö²¼Ê½¼Ü¹¹Ìṩ¸ß¿ÉÓÃÐÔ¼°ÄÍÓÃÐԵķþÎñ¡£ËüʵÏÖÁ˳¬´ó¹æÄ£µÄ¼¯Èº£¬²¢ÌṩһÖÖ³Æ×÷¡°×îÖÕÒ»ÖÂÐÔ¡±µÄÒ»ÖÂÐÔÀàÐÍ£¬ÕâÒâζ×ÅÔÚÈκÎʱ¿Ì£¬ÔÚ²»Í¬·þÎñÆ÷ÖеÄÏàͬÊý¾Ý¿âÌõÄ¿¿ÉÒÔÓв»Í¬µÄÖµ¡£
SQL on Hadoop
¿ªÔ´ÉçÇøÒµ³öÏÖÁ˺ܶà SQL-on-HadoopµÄÏîÄ¿£¬×ÅÑÛ¸úһЩÉÌÒµµÄÊý¾Ý²Ö¿âϵͳ¾ºÕù¡£°üÀ¨Apache
Hive, Spark SQL, Cloudera Impala, Hortonworks Stinger,
Facebook Presto, Apache Tajo£¬Apache Drill¡£ÓÐЩÊÇ»ùÓÚGoogle
DremelÉè¼Æ¡£
Impala
Cloudera¹«Ë¾Ö÷µ¼¿ª·¢µÄÐÂÐͲéѯϵͳ£¬ËüÌṩSQLÓïÒ壬Äܹ»²éѯ´æ´¢ÔÚHadoopµÄHDFSºÍHBaseÖеÄPB¼¶´óÊý¾Ý£¬ºÅ³Æ±ÈHive¿ì5-10±¶£¬µ«×î½ü±»SparkµÄ·çÍ·¸øÕÖסÁË£¬´ó¼Ò»¹ÊǸüÇãÏòÓÚºóÕß¡£
Drill
ApacheÉçÇøÀàËÆÓÚDremelµÄ¿ªÔ´°æ±¾¡ªDrill¡£Ò»¸öרΪ»¥¶¯·ÖÎö´óÐÍÊý¾Ý¼¯µÄ·Ö²¼Ê½ÏµÍ³¡£
Druid
ÔÚ´óÊý¾Ý¼¯Ö®ÉÏ×öʵʱͳ¼Æ·ÖÎö¶øÉè¼ÆµÄ¿ªÔ´Êý¾Ý´æ´¢¡£Õâ¸öϵͳ¼¯ºÏÁËÒ»¸öÃæÏòÁд洢µÄ²ã£¬Ò»¸ö·Ö²¼Ê½¡¢shared-nothingµÄ¼Ü¹¹£¬ºÍÒ»¸ö¸ß¼¶µÄË÷Òý½á¹¹£¬À´´ï³ÉÔÚÃë¼¶ÒÔÄÚ¶ÔÊ®ÒÚÐм¶±ðµÄ±í½øÐÐÈÎÒâµÄ̽Ë÷·ÖÎö¡£
Berkeley Data Analytics Stack

ÉÏÃæËµµÀSpark£¬ÔÚBerkeley AMP lab ÖÐÓиö¸üºêΰµÄÀ¶Í¼£¬¾ÍÊÇBDAS£¬ÀïÃæÓкܶàÃ÷ÐÇÏîÄ¿£¬³ýÁËSpark£¬»¹°üÀ¨£º
Mesos£ºÒ»¸ö·Ö²¼Ê½»·¾³µÄ×ÊÔ´¹ÜÀíÆ½Ì¨£¬ËüʹµÃHadoop¡¢MPI¡¢Spark×÷ÒµÔÚͳһ×ÊÔ´¹ÜÀí»·¾³ÏÂÖ´ÐС£Ëü¶ÔHadoop2.0Ö§³ÖºÜºÃ¡£Twitter£¬Coursera¶¼ÔÚʹÓá£
Tachyon£ºÊÇÒ»¸ö¸ßÈÝ´íµÄ·Ö²¼Ê½Îļþϵͳ£¬ÔÊÐíÎļþÒÔÄÚ´æµÄËÙ¶ÈÔÚ¼¯Èº¿ò¼ÜÖнøÐпɿ¿µÄ¹²Ïí£¬¾ÍÏñSparkºÍMapReduceÄÇÑù¡£ÏîÄ¿·¢ÆðÈËÀîºÆÔ´ËµÄ¿Ç°·¢Õ¹·Ç³£¿ì£¬ÉõÖÁ±ÈSparkµ±Ê±»¹Òª¾ªÈË£¬ÒѾ³ÉÁ¢´´Òµ¹«Ë¾Tachyon
Nexus.
BlinkDB£ºÒ²ºÜÓÐÒâ˼£¬ÔÚº£Á¿Êý¾ÝÉÏÔËÐн»»¥Ê½ SQL ²éѯµÄ´ó¹æÄ£²¢ÐвéѯÒýÇæ¡£ËüÔÊÐíÓû§Í¨¹ýȨºâÊý¾Ý¾«¶ÈÀ´ÌáÉý²éѯÏìӦʱ¼ä£¬ÆäÊý¾ÝµÄ¾«¶È±»¿ØÖÆÔÚÔÊÐíµÄÎó²î·¶Î§ÄÚ¡£
Cloudera

HadoopÀÏ´ó¸çÌá³öµÄ¾µä½â¾ö·½°¸¡£
HDP £¨Hadoop Data Platform)

Hortonworks Ìá³öµÄ¼Ü¹¹Ñ¡ÐÍ¡£
Redshift

Amazon RedShiftÊÇ ParAccelÒ»¸ö°æ±¾¡£ËüÊÇÒ»ÖÖ£¨massively
parallel computer£©¼Ü¹¹£¬ÊǷdz£·½±ãµÄÊý¾Ý²Ö¿â½â¾ö·½°¸£¬SQL½Ó¿Ú£¬¸ú¸÷¸öÔÆ·þÎñÎÞ·ìÁ¬½Ó£¬×î´óÌØµã¾ÍÊǿ죬ÔÚTBµ½PB¼¶±ð·Ç³£ºÃµÄÐÔÄÜ£¬ÎÒÔÚ¹¤×÷ÖÐÒ²ÊÇÖ±½ÓʹÓã¬Ëü»¹Ö§³Ö²»Í¬µÄÓ²¼þƽ̨£¬Èç¹ûÏëËٶȸü¿ì£¬¿ÉÒÔʹÓÃSSD¡£
Netflix

ÍêÈ«»ùÓÚAWSµÄÊý¾Ý´¦Àí½â¾ö·½°¸¡£
Intel

|