暂无图片
暂无图片
暂无图片
暂无图片
暂无图片
大数据技术之高频面试题.docx
181
123页
4次
2022-09-21
免费下载
目录
1
章 面试说明
................................................................................................................................9
1.1
面试过程最关键的是什么?
...............................................................................................9
1.2
面试时该怎么说?
...............................................................................................................9
1
)语言表达清楚
...............................................................................................................9
2
)所述内容不犯错
...........................................................................................................9
1.3
面试技巧
...............................................................................................................................9
1.3.1
六个常见问题
............................................................................................................9
1.3.2
两个注意事项
..........................................................................................................10
1.3.3
自我介绍(控制在
4
分半以内,不超过
5
分钟)
..............................................10
2
章 手写代码
..............................................................................................................................10
2.1
冒泡排序
.............................................................................................................................10
2.2
二分查找
.............................................................................................................................11
2.3
快排
.....................................................................................................................................13
2.4
归并
.....................................................................................................................................14
2.5
二叉树之
Scala
实现
.........................................................................................................16
2.5.1
二叉树概念
..............................................................................................................16
2.5.2
二叉树的特点
..........................................................................................................16
2.5.3
二叉树的
Scala
代码实现
.......................................................................................16
2.6
手写
Spark-WordCount.......................................................................................................20
2.7
手写
Spark
程序
..................................................................................................................21
3
章 项目架构
..............................................................................................................................21
3.1
数仓概念
.............................................................................................................................21
3.2
系统数据流程设计
.............................................................................................................21
3.3
框架版本选型
.....................................................................................................................21
3.4
服务器选型
.........................................................................................................................21
1
)机器成本考虑:
.........................................................................................................22
2
)运维成本考虑:
.........................................................................................................22
3.5
集群规模
.............................................................................................................................22
3.6
人员配置参考
.....................................................................................................................22
3.6.1
整体架构。
..............................................................................................................22
3.6.2
你们部门的职级等级,晋升规则
.........................................................................22
3.6.3
人员配置参考
..........................................................................................................22
4
章 项目涉及技术
......................................................................................................................23
4.1 Linux&Shell
相关总结
.......................................................................................................23
4.1.1 Linux
常用命令
.......................................................................................................23
4.1.2 Shell
常用工具
.........................................................................................................23
4.2 Hadoop
相关总结
...............................................................................................................24
4.2.1 Hadoop
常用端口号
................................................................................................24
4.2.2 Hadoop
配置文件以及简单的
Hadoop
集群搭建
.................................................24
4.2.3 HDFS
读流程和写流程
...........................................................................................25
4.2.4 MapReduce
Shuffle
Hadoop
化)
...................................................................................................................................26
4.2.5 Yarn
Job
提交流程
..............................................................................................29
4.2.6 Yarn
的默认调度器、调度器分类、以及他们之间的区别
.................................29
4.2.7
项目经验之
LZO
压缩
............................................................................................30
4.2.8 Hadoop
参数调优
....................................................................................................31
4.2.9
项目经验之基准测试
..............................................................................................31
4.2.10 Hadoop
宕机
..........................................................................................................31
4.3 Zookeeper
相关总结
...........................................................................................................31
4.3.1
选举机制
..................................................................................................................31
4.3.2
常用命令
..................................................................................................................32
4.4 Flume
相关总结
..................................................................................................................32
4.4.1 Flume
组成,
Put
事务,
Take
事务
........................................................................32
4.4.2 Flume
拦截器
...........................................................................................................32
4.4.3 Flume
采集数据会丢失吗
?
(防止数据丢失的机制)
........................................33
4.4.4 Flume
内存
...............................................................................................................33
4.4.5 FileChannel
优化
.....................................................................................................33
4.4.6 HDFS Sink
小文件处理
..........................................................................................33
4.5 Kafka
相关总结
..................................................................................................................34
4.5.1 Kafka
架构
...............................................................................................................34
4.5.2 Kafka
压测
...............................................................................................................34
4.5.3 Kafka
的机器数量
...................................................................................................34
4.5.4 Kafka
的日志保存时间
...........................................................................................34
4.5.5 Kafka
的硬盘大小
...................................................................................................34
4.5.6 Kafka
监控
...............................................................................................................34
4.5.7 Kakfa
分区数
...........................................................................................................35
4.5.8
副本数设定
..............................................................................................................35
4.5.9
多少个
Topic............................................................................................................35
4.5.10 Kafka
丢不丢数据
.................................................................................................35
4.5.11 Kafka
ISR
副本同步队列
.................................................................................35
1
of 123
免费下载
【版权声明】本文为墨天轮用户原创内容,转载时必须标注文档的来源(墨天轮),文档链接,文档作者等基本信息,否则作者和墨天轮有权追究责任。如果您发现墨天轮中有涉嫌抄袭或者侵权的内容,欢迎发送邮件至:contact@modb.pro进行举报,并提供相关证据,一经查实,墨天轮将立刻删除相关内容。

评论

关注
最新上传
暂无内容,敬请期待...
下载排行榜
Top250 周榜 月榜