Split MAP and REDUCE tasks into individual mesos tasks #60

tarnfeld · 2015-05-20T23:46:46Z

This commit splits out the resources for MAP and REDUCE slots into two Mesos tasks instead of one, while still using a single TaskTracker JVM. This allows the idle-slot tracking to operator on MAP and REDUCE slots individually further increasing our ability to release idle resources faster.

This is an implementation of #47.

correct spelling mistake propertios->properties

Update README.md

Warning: MESOS_NATIVE_LIBRARY is deprecated, use MESOS_NATIVE_JAVA_LIBRARY instead. Future releases will not support JNI bindings via MESOS_NATIVE_LIBRARY.

Use MESOS_NATIVE_JAVA_LIBRARY in README.md

…through tasks, then decide whether they are a managed TaskTracker * ResourcePolicy is abstract for all intents, but it could be instantiated. Make it literally abstract * A bunch of lint warnings removed * Rearrange code to be easier to read -- interface implementations commented and methods in order, update some docs to JavaDoc format.

Modified `MesosScheduler.java` and `configuration.md`. Now `mapred.mesos.framework.principal`, `mapred.mesos.framework.secretfile`, `mapred.mesos.framework.user`, and `mapred.mesos.framework.name` are configureable options. Addresses issue mesos#53 Added Support for Framework Authentication Added Support for Framework Authentication

Added Framework Authentication (Issue mesos#53).

Previously the "idle check" would be run against all task trackers regardless of whether they have any jobs assigned to them or not. The main MesosScheduler is responsible for cleaning up task trackers once jobs have *finished* so this change stops us performing idle checks on trackers that have no jobs. This change fixes the observed behaviour of task trackers being killed and respawning continuously when they're waiting for jobs (e.g with the min map/reduce slot config option, or the fixed resource policy).

Avoid respawning task trackers constantly when they are idle

This commit splits out the resources for MAP and REDUCE slots into two Mesos tasks instead of one. This allows the idle-slot tracking to operator on MAP and REDUCE slots individually further increasing our ability to release idle resources faster.

tarnfeld · 2015-05-20T23:54:41Z

This screenshot was taken while a job was running on a shared cluster, and it's possible to see quite clearly that some Reduce slots (from the 0th task tracker) were revoked and the resources freed, while the maps are still allocated.

brndnmtthws · 2015-05-20T23:59:35Z

src/main/java/org/apache/hadoop/mapred/MesosExecutor.java

+          @Override
+          public void run() {
+            try {
+              taskTracker.run();


Interesting. It's safe to reuse the same object across different threads?

I guess it was like this before, but the code was rearranged. I'll take it that it is safe.

Yeah I just moved the code around. I think it is safe so long as it's synchronized() properly which I think it is. Perhaps it's worth running over all the code making sure any access to the taskTracker field is properly thread safe.

brndnmtthws · 2015-05-21T00:18:43Z

Looks pretty good to me. If you've tested it, merge away.

Might be worth bumping the version in the pom too.

tarnfeld · 2015-05-21T01:10:59Z

Thanks for the quick review! I've just rolled this out on one of our clusters so I want to let things settle a bit first, and get some serious traffic through the JT/TTs before saying it's good to go.

In one of the code paths (related to flaky trackers) we were using synchronized() in nested function calls, agains the same object. This is not needed, and causes a deadlock.

If we synchronized() against the scheduler here and we grab hold of the lock, at the same time in another thread a callback from Mesos comes in and that too also calls synchronized(). This behaviour casuses the Mesos Scheduler driver to lock up because it's single threaded. In the event that the former then decides to kill a mesos task, we'll see a deadlock because the killTask() message can't be sent while the driver is waiting on another callback.

hansbogert and others added 16 commits April 13, 2015 14:32

Update README.md

c3445a1

correct spelling mistake propertios->properties

Merge pull request mesos#48 from hansbogert/patch-1

b11cd64

Update README.md

Use MESOS_NATIVE_JAVA_LIBRARY in README.md

b69bf8f

Warning: MESOS_NATIVE_LIBRARY is deprecated, use MESOS_NATIVE_JAVA_LIBRARY instead. Future releases will not support JNI bindings via MESOS_NATIVE_LIBRARY.

Merge pull request mesos#50 from hansbogert/patch-1

5c85bf7

Use MESOS_NATIVE_JAVA_LIBRARY in README.md

Fixes per PR changes. Addiiton of release plugin to follow.

6e93e75

Missed a comment, rolled back version change until another PR.

ee1c876

Add release plugin and changes to facilitate publishing to Sonatype OSS

6352915

Merge remote-tracking branch 'briantopping/master'

7ca77dd

Merge remote-tracking branch 'briantopping/ReleaseChanges'

bd0fc09

Merge pull request mesos#54 from DarinJ/master

c99b6e9

Added Framework Authentication (Issue mesos#53).

Sync the tracker.stop() call to the scheduler

d6b0d0c

Merge pull request mesos#59 from duedil-ltd/fix/tt-respawning

b7a8278

Avoid respawning task trackers constantly when they are idle

brndnmtthws reviewed May 20, 2015
View reviewed changes

tarnfeld added 4 commits May 21, 2015 02:27

Remove killTasks() synchronized block to avoid a deadlock

05b7acd

In one of the code paths (related to flaky trackers) we were using synchronized() in nested function calls, agains the same object. This is not needed, and causes a deadlock.

Revert back to using a simpler TaskLauncher interrupt model

0d54030

Ensure the suicide timer repeats itself

3f8c161

kensipe force-pushed the master branch from c972174 to 4fd5bf1 Compare October 16, 2015 20:02

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Split MAP and REDUCE tasks into individual mesos tasks #60

Split MAP and REDUCE tasks into individual mesos tasks #60

tarnfeld commented May 20, 2015

tarnfeld commented May 20, 2015

brndnmtthws May 20, 2015

brndnmtthws May 21, 2015

tarnfeld May 21, 2015

brndnmtthws commented May 21, 2015

tarnfeld commented May 21, 2015

Split MAP and REDUCE tasks into individual mesos tasks #60

Are you sure you want to change the base?

Split MAP and REDUCE tasks into individual mesos tasks #60

Conversation

tarnfeld commented May 20, 2015

tarnfeld commented May 20, 2015

brndnmtthws May 20, 2015

Choose a reason for hiding this comment

brndnmtthws May 21, 2015

Choose a reason for hiding this comment

tarnfeld May 21, 2015

Choose a reason for hiding this comment

brndnmtthws commented May 21, 2015

tarnfeld commented May 21, 2015